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Abstract 

The theory of gravitational lensing is reviewed from a spacetime perspective, without 
quasi-Newtonian approximations. More precisely, the review covers all aspects of gravita- 
tional lensing where light propagation is described in terms of lightlike geodesies of a metric 
of Lorentzian signature. It includes the basic equations and the relevant techniques for calcu- 
lating the position, the shape, and the brightness of images in an arbitrary general-relativistic 
spacetime. It also includes general theorems on the classification of caustics, on criteria for 
multiple imaging, and on the possible number of images. The general results are illustrated 
with examples of spacetimes where the lensing features can be explicitly calculated, including 
the Schwarzschild spacetime, the Kerr spacetime, the spacetime of a straight string, plane 
gravitational waves, and others. 
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1 Introduction 



In its most general sense, gravitational lensing is a collective term for all effects of a gravitational 
field on the propagation of electromagnetic radiation, with the latter usually described in terms 
of rays. According to general relativity, the gravitational field is coded in a metric of Lorentzian 
signature on the 4-dimensional spacetime manifold, and the light rays are the lightlike geodesies 
of this spacetime metric. From a mathematical point of view, the theory of gravitational lensing 
is thus the theory of lightlike geodesies in a 4-dimensional manifold with a Lorentzian metric. 

The first observation of a 'gravitational lensing' effect was made when the deflection of star 
light by our Sun was verified during a Solar eclipse in 1919. Today, the list of observed phenomena 
includes the following: 

Multiple quasars. 

The gravitational field of a galaxy (or a cluster of galaxies) bends the light from a distant quasar 
in such a way that the observer on Earth sees two or more images of the quasar. 

Rings. 

An extended light source, like a galaxy or a lobe of a galaxy, is distorted into a closed or al- 
most closed ring by the gravitational field of an intervening galaxy. This phenomenon occurs in 
situations where the gravitational field is almost rotationally symmetric, with observer and light 
source close to the axis of symmetry. It is observed primarily, but not exclusively, in the radio range. 

Arcs. 

Distant galaxies are distorted into arcs by the gravitational field of an intervening cluster of galax- 
ies. Here the situation is less symmetric than in the case of rings. The effect is observed in the 
optical range and may produce "giant luminous arcs" , typically of a characteristic blue color. 

Microlensing. 

When a light source passes behind a compact mass, the focusing effect on the light leads to a 
temporal change in brightness (energy flux). This microlensing effect is routinely observed since 
the early 1990s by monitoring a large number of stars in the bulge of our Galaxy, in the Magellanic 
Clouds and in the Andromeda galaxy. Microlensing has also been observed on quasars. 

Image distortion by weak lensing. 

In cases where the distortion effect on galaxies is too weak for producing rings or arcs, it can be ver- 
ified with statistical methods. By evaluating the shape of a large number of background galaxies in 
the field of a galaxy cluster, one can determine the surface mass density of the cluster. By evaluat- 
ing fields without a foreground cluster one gets information about the large-scale mass distribution. 

Observational aspects of gravitational lensing and methods of how to use lensing as a tool in 
astrophysics are the subject of the Living Review by Wambsganss |427j . There the reader may 
also find some notes on the history of lensing. 

The present review is meant as complementary to the review by Wambsganss. While all the 
theoretical methods reviewed in |427j rely on quasi-Newtonian approximations, the present review 
is devoted to the theory of gravitational lensing from a spacetime perspective, without such approx- 
imations. Here the terminology is as follows: "Lensing from a spacetime perspective" means that 
light propagation is described in terms of lightlike geodesies of a general-relativistic spacetime met- 
ric, without further approximations. (The term "non-perturbative lensing" is sometimes used in 
the same sense.) "Quasi-Newtonian approximation" means that the general-relativistic spacetime 
formalism is reduced by approximative assumptions to essentially Newtonian terms (Newtonian 



2 



space, Newtonian time, Newtonian gravitational field). The quasi-Newtonian approximation for- 
malism of lensing comes in several variants, and the relation to the exact formalism is not always 
evident because sometimes plausibility and ad-hoc assumptions are implicitly made. A common 
feature of all variants is that they are "weak-field approximations" in the sense that the spacetime 
metric is decomposed into a background ( "spacetime without the lens" ) and a small perturbation 
of this background ("gravitational field of the lens"). For the background one usually chooses 
either Minkowski spacetime (isolated lens) or a spatially fiat Robertson-Walker spacetime (lens 
embedded in a cosmological model). The background then defines a Euclidean 3-space, similar to 
Newtonian space, and the gravitational field of the lens is similar to a Newtonian gravitational 
field on this Euclidean 3-space. Treating the lens as a small perturbation of the background means 
that the gravitational field of the lens is weak and causes only a small deviation of the light rays 
from the straight lines in Euclidean 3-space. In its most traditional version, the formalism as- 
sumes in addition that the lens is "thin" , and that the lens and the light sources are at rest in 
Euclidean 3-space, but there are also variants for "thick" and moving lenses. Also, modifications 
for a spatially curved Robertson-Walker background exist, but in all variants a non-trivial topo- 
logical or causal structure of spacetime is (explicitly or implicitly) excluded. At the center of the 
quasi- Newtonian formalism is a "lens equation" or "lens map" , which relates the position of a 
"lensed image" to the position of the corresponding "unlensed image". In the most traditional 
version one considers a thin lens at rest, modeled by a Newtonian gravitational potential given 
on a plane in Euclidean 3-space ("lens plane"). The light rays are taken to be straight lines in 
Euclidean 3-space except for a sharp bend at the lens plane. For a fixed observer and light sources 
distributed on a plane parallel to the lens plane ( "source plane" ) , the lens map is then a map from 
the lens plane to the source plane. In this way, the geometric spacetime setting of general relativity 
is completely covered behind a curtain of approximations, and one is left simply with a map from 
a plane to a plane. Details of the quasi-Newtonian approximation formalism can be found not only 
in the above-mentioned Living Review ^427., but also in the monographs of Schneider, Ehlers, and 
Falco |367] and Fetters, Levine, and Wambsganss [343]. 

The quasi-Newtonian approximation formalism has proven very successful for using gravita- 
tional lensing as a tool in astrophysics. This is impressively demonstrated by the work reviewed 
in [427] . On the other hand, studying lensing from a spacetime perspective is of relevance under 
three aspects: 

Didactical. 

The theoretical foundations of lensing can be properly formulated only in terms of the full formal- 
ism of general relativity. Working out examples with strong curvature and with non-trivial causal 
or topological structure demonstrates that, in principle, lensing situations can be much more com- 
plicated than suggested by the quasi-Newtonian formalism. 

Methodological. 

General theorems on lensing (e.g., criteria for multiple imaging, characterizations of caustics, etc.) 
should be formulated within the exact spacetime setting of general relativity, if possible, to make 
sure that they are not just an artifact of approximative assumptions. For those results which do 
not hold in arbitrary spacetimes, one should try to find the precise conditions on the spacetime 
under which they are true. 

Practical. 

There are some situations of astrophysical interest to which the quasi-Newtonian formalism does 
not apply. For instance, near a black hole light rays are so strongly bent that, in principle, they can 
make arbitrarily many turns around the hole. Clearly, in this situation it is impossible to use the 
quasi-Newtonian formalism which would treat these light rays as small perturbations of straight 
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lines. 



The present review tries to elucidate all three aspects. More precisely, the following subjects 
will be covered: 

• The basic equations and all relevant techniques that are needed for calculating the position, 
the shape, and the brightness of images in an arbitrary general-relativistic spacetime are re- 
viewed. Part of this material is well-established since decades, like the Sachs equations for the 
optical scalars (Section 12. 3|) . which are of crucial relevance for calculating distance measures 
(Section 12.41) . image distortion (Section 12.51) . and the brightness of images (Section 12. 6p . It 
is included here to keep the review self-contained. Other parts refer to more recent devel- 
opments which are far from being fully explored, like the exact lens map (Section 12. 1[) and 
variational techniques (Section l2.9|) . Specifications and simplifications are possible for space- 
times with symmetries. The case of spherically symmetric and static spacetimes is treated 
in greater detail (Section 14. 3|) . 

• General theorems on lensing in arbitrary spacetimes, or in certain classes of spacetimes, are 
reviewed. Some of these results are of a local character, like the classification of locally 
stable caustics (Section l2.2p . Others are related to global aspects, like the criteria for multi- 
ple imaging in terms of conjugate points and cut points (Sections 12.71 and 12. 8|) . The global 
theorems can be considerably strengthened if one restricts to globally hyperbolic spacetimes 
(Section 13. 1|) or, more specifically, to asymptotically simple and empty spacetimes (Sec- 
tion |331)- The latter may be viewed as spacetime models for isolated transparent lenses. 
Also, in globally hyperbolic spacetimes Morse theory can be used for investigating whether 
the total number of images is finite or infinite, even or odd (Section 13. 3|) . In a spherically 
symmetric and static spacetime, the occurrence of an infinite sequence of images is related 
to the occurrence of a "light sphere" (circular lightlike geodesies), like in the Schwarzschild 
spacetime at r = 3m (Section 14.31) . 

• Several examples of spacetimes are considered, where the lightlike geodesies and, thus, the 
lensing features can be calculated explicitly. The examples are chosen such that they illustrate 
the general results. Therefore, in many parts of the review the reader will find suggestions to 
look at pictures in the example section. The best known and astrophysically most relevant 
examples are the Schwarzschild spacetime (Section 15. ip , the Kerr spacetime (Section 15. 8p 
and the spacetime of a straight string (Section I5.10p . Schwarzschild black hole lensing and 
Kerr black hole lensing was intensively investigated already in the 1960s, 1970s, and 1980s, 
with astrophysical applications concentrating on observable features of accretion disks. More 
recently, the increasing evidence that there is a black hole at the center of our Galaxy (and 
probably at the center of most galaxies) has led to renewed and intensified interest in black 
hole lensing (see Sections 15.11 and 15 .8p . This is a major reason for the increasing number of 
articles on lensing beyond the quasi-Newtonian approximation. 

This introduction ends with some notes on subjects not covered in this review: 
Wave optics. 

In the electromagnetic theory, light is described by wavelike solutions to Maxwell's equations. The 
ray-optical treatment used throughout this review is the standard high-frequency approximation 
(geometric optics approximation) of the electromagnetic theory for light propagation in vacuum 
on a general-relativistic spacetime (see, e.g., |279] . § 22.5 or |367] , Section 3.2). (Other notions 
of vacuum light rays, based on a different approximation procedure, have been occasionally sug- 
gested |271j . but will not be considered here. Also, results specific to spacetime dimensions other 
than four or to gravitational theories other than Einstein's are not covered.) For most applications 
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to lensing the ray-optical treatment is valid and appropriate. An exception, where wave-optical 
corrections are necessary, is the calculation of the brightness of images if a light source comes very 
close to the caustic of the observer's light cone (see Section [^^ . 

Light propagation in matter. 

If light is directly influenced by a medium, the light rays are no longer the lightlike geodesies of 
the spacetime metric. For an isotropic non-dispersive medium, they are the lightlike geodesies of 
another metric which is again of Lorentzian signature. (This "optical metric" was introduced by 
Gordon jl79j . For a rigourous derivation, starting from Maxwell's equation in an isotropic non- 
dispersive medium, see Ehlers [114j .) Hence, the formalism used throughout this review still applies 
to this situation after an appropriate re-interpretation of the metric. In anisotropic or dispersive 
media, however, the light rays are not the lightlike geodesies of a Lorentzian metric. There are 
some lensing situations where the influence of matter has to be taken into account. For instance., 
for the deflection of radio signals by our Sun the influence of the plasma in the Solar corona (to 
be treated as a dispersive medium) is very well measurable. However, such situations will not 
be considered in this review. For light propagation in media on a general-relativistic spacetime, 
see |337j and references cited therein. 

Kinetic theory. 

As an alternative to the (geometric optics approximation of) electromagnetic theory, light can be 
treated as a photon gas, using the formalism of kinetic theory. This has relevance, e.g., for the 
cosmic background radiation. For basic notions of general-relativistic kinetic theory see, e.g., [115j . 
Apart from some occasional remarks, kinetic theory will not be considered in this review. 

Derivation of the quasi-Newtonian formaUsm. 

It is not satisfacory if the quasi-Newtonian formalism of lensing is set up with the help of ad-hoc 
assumptions, even if the latter look plausible. From a methodological point of view, it is more 
desirable to start from the exact spacetime setting of general relativity and to derive the quasi- 
Newtonian lens equation by a well-defined approximation procedure. In comparison to earlier such 
derivations [3671 13621 1373) later effort has led to considerable improvements. For lenses embedded 
in a cosmological model, see Pyne and Birkinshaw |352j who consider lenses that need not be thin 
and may be moving on a Robertson-Walker background (with positive, negative, or zero spatial 
curvature). For the non-cosmological situation, a Lorentz covariant approximation formalism was 
derived by Kopeikin and Schafer [238] . Here Minkowski spacetime is taken as the background, and 
again the lenses need not be thin and may be moving. 
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2 Lensing in Arbitrary Spacetimes 



By a spacetime we mean a 4-dimensional manifold M with a (C°°, if not otherwise stated) metric 
tensor field g of signature (+,+,+,—) that is time-oriented. The latter means that the non- 
spacelike vectors make up two connected components in the entire tangent bundle, one of which is 
called "future-pointing" and the other one "past-pointing" . Throughout this review we restrict to 
the case that the light rays are freely propagating in vacuum, i.e., are not influenced by mirrors, 
refractive media, or any other impediments. The light rays are then the lightlike geodesies of 
the spacetime metric. We first summarize results on the lightlike geodesies that hold in arbitrary 
spacetimes. In Section |3] these results will be specified for spacetimes with conditions on the causal 
structure and in Section |4] for spacetimes with symmetries. 

2.1 Light cone and exact lens map 

In an arbitrary spacetime (M^g), what an observer at an event po can see is determined by the 
lightlike geodesies that issue from po into the past. Their union gives the past light cone of po- 
This is the central geometric object for lensing from the spacetime perspective. For a point source 
with worldline 7s, each past-oriented lightlike geodesic A from po to 73 gives rise to an image of 
7s on the observer's sky. One should view any such A as the central ray of a thin bundle that is 
focused by the observer's eye lens onto the observer's retina (or by a telescope onto a photographic 
plate). Hence, the intersection of the past light cone with the world-line of a point source (or 
with the world-tube of an extended source) determines the visual appearance of the latter on the 
observer's sky. 

In mathematical terms, the observer's sky or celestial sphere So can be viewed as the set of all 
lightlike directions at po- Every such direction defines a unique (up to parametrization) lightlike 
geodesic through po, so So may also be viewed as a subset of the space of all lightlike geodesies in 
{M,g) (cf. [263) 1. One may choose at po a future-pointing vector Uo with g{Uo,Uo) = —1, to be 
interpreted as the 4-velocity of the observer. This allows identifying the observer's sky So with a 
subset of the tangent space Tp^Ai, 

So ^ {w eTp^M\g{w,w) =0 and g{w,Uo) ^l} ■ (1) 

If Uo is changed, this representation changes according to the standard aberration formula of 
special relativity. By definition of the exponential map exp, every affinely parametrized geodesic 
s H> A(s) satisfies A(s) = cxp(s A(0)). Thus, the past light cone of po is the image of the map 

{s,w) I — > exp(sK;), (2) 

which is defined on a subset of ]0, oo[x5o. If wc restrict to values of s sufficiently close to 0, the 
map ^ is an embedding, i.e., this truncated light cone is an embedded submanifold; this follows 
from the well-known fact that exp maps a neighborhood of the origin, in each tangent space, 
diffeomorphically into the manifold. However, if we extend the map ^ to larger values of s, it is 
in general neither injective nor an immersion; it may form folds, cusps, and other forms of caustics, 
or transverse self-intersections. This observation is of crucial importance in view of lensing. There 
are some lensing phenomena, such as multiple imaging and image distortion of (point) sources into 
(1-dimensional) rings, which can occur only if the light cone fails to be an embedded submanifold 
(see Section [2.8p . Such lensing phenomena are summarized under the name strong lensing effects. 
As long as the light cone is an embedded submanifold, the effects exerted by the gravitational 
field on the apparent shape and on the apparent brightness of light sources are called weak lensing 
effects. For examples of light cones with caustics and/or transverse self-intersections, see FigureslTSl 
1251 and 1261 These pictures show light cones in spacetimes with symmetries, so their structure is 
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rather regular. A realistic model of our own light cone, in the real world, would have to take into 
account numerous irregularly distributed inhomogeneities ( "clumps" ) that bend light rays in their 
neighborhood. Ellis, Bassett, and Dunsby [129] estimate that such a light cone would have at least 
10^^ caustics which are hierarchically structured in a way that reminds of fractals. 

For calculations it is recommendable to introduce coordinates on the observer's past light 
cone. This can be done by choosing an orthonormal tetrad (69,61,62,63) with 60 = —Uq at the 
observation event po- This parametrizes the points of the observer's celestial sphere by spherical 
coordinates 

w = sin Q cos ^' 61 + sin 9 sin 62 + cos 63 + 60. (3) 

In this representation, map ([2]) maps each (s, ^ff, Q) to a spacetime point. Letting the observation 
event float along the observer's worldline, parametrized by proper time r, gives a map that assigns 
to each (s, 5*, 9, r) a spacetime point. In terms of coordinates x — (x", a;^, x^, x^) on the spacetime 
manifold, this map is of the form 

x' ^ F'{s,^,Q,t), i = 0,l,2,3. (4) 

It can be viewed as a map from the world as it appears to the observer (via optical observations) 
to the world as it is. The coordinates (s,\I',9,t) were called optical coordinates by Temple |401j 
and observational coordinates by Ellis jl28j . A detailed discussion of their properties can be found 
in |130) . They are particularly useful in cosmology but can be introduced for any observer in 
any spacetime. It is useful to consider observables, such as distance measures (see Section [Z4| or 
the ellipticity that describes image distortion (see Section 12. Sp as functions of the observational 
coordinates. Some observables, e.g., the redshift and the luminosity distance, are not determined 
by the spacetime geometry and the observer alone, but also depend on the 4-velocities of the light 
sources. If a vector field U with g{U, U) = —1 has been fixed, one may restrict to an observer and 
to light sources which are integral curves of U. The above-mentioned observables, like redshift and 
luminosity distance, are then uniquely determined as functions of the observational coordinates. 
In applications to cosmology one chooses U as tracing the mean flow of luminous matter ( "Hubble 
flow" ) or as the rest system of the cosmic background radiation; present observations are compatible 
with the assumption that these two distinguished observer fields coincide [15] 1871 [2l3J . 

Writing map ^ explicitly requires solving the lightlike geodesic equation. This is usually done, 
using standard index notation, in the Lagrangian formalism, with the Lagrangian C = ^gij{x)x'^x^ , 
or in the Hamiltonian formalism, with the Hamiltonian H — ^g"^^ {x)piPj. A non-trivial example 
where the solutions can be explicitly written in terms of elementary functions is the string spacetime 
of Section 15.101 Somewhat more general, although still very special, is the situation that the 
lightlike geodesic equation admits three independent constants of motion in addition to the obvious 
one g^^{x)piPj = 0. If, for any pair of the four constants of motion, the Poisson bracket vanishes 
("complete integrability" ) , the lightlike geodesic equation can be reduced to first-order form, i.e., 
the light cone can be written in terms of integrals over the metric coefficients. This is true, e.g., 
in spherically symmetric and static spacetimes (see Section l473l) . 

Having parametrized the past light cone of the observation event po in terms of {s^w)^ or 
more specifically in terms of (s, ^, 9), one may set up an exact lens map. This exact lens map is 
analogous to the lens map of the quasi-Newtonian approximation formalism, as far as possible, but 
it is valid in an arbitrary spacetime without approximation. In the quasi-Newtonian formalism for 
thin lenses at rest, the lens map assigns to each point in the lens plane a point in the source plane 
(see, e.g., [367| 13431 1427J ). When working in an arbitrary spacetime without approximations, the 
observer's sky So is an obvious substitute for the lens plane. As a substitute for the source plane 
we choose a 3-dimensional submanifold T with a prescribed ruling by timelike curves. We assume 
that T is globally of the form Q x E, where the points of the 2-manifold Q label the timelike curves 
by which T is ruled. These timelike curves are to be interpreted as the worldlines of light sources. 
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We call any such T a source surface. In a nutshell, choosing a source surface means choosing a 
two-parameter family of light sources. 

The exact lens map is a map from So to Q. It is defined by following, for each w € So, the past- 
pointing geodesic with initial vector w until it meets T and then projecting to Q (see Figure [T]). 
In other words, the exact lens map says, for each point on the observer's celestial sphere, which of 
the chosen light sources is seen at this point. Clearly, non-invertibility of the lens map indicates 
multiple imaging. What one chooses for T depends on the situation. In applications to cosmology, 
one may choose galaxies at a fixed redshift z ~ around the observer. In a spherically-symmetric 
and static spacetime one may choose static light sources at a fixed radius value r — rg. Also, the 
surface of an extended light source is a possible choice for T. 




Q 



Figure 1: Illustration of the exact lens map. po is the chosen observation event, T is the chosen 
source surface. T is a hypersurface ruled by timelike curves (worldlines of light sources) which are 
labeled by the points of a 2-dimensional manifold Q. The lens map is defined on the observer's 
celestial sphere So, given by Equation ([T|), and takes values in Q. For each w e So, one follows the 
lightlike geodesic with this initial direction until it meets T and then projects to Q. For illustrating 
the exact lens map, it is an instructive exercise to intersect the light cones of Figures [131 HSl [26l 
and |30] with various source surfaces T. 

The exact lens map was introduced by Frittelli and Newman |154j and further discussed in |1171 
1116) . The following global aspects of the exact lens map were investigated in [338]. First, in general 
the lens map is not defined on all of So because not all past-oriented lightlike geodesies that start 
at Po necessarily meet T. Second, in general the lens map is multi-valued because a lightlike 
geodesic might meet T several times. Third, the lens map need not be differentiable and not even 
continuous because a lightlike geodesic might meet T tangentially. In [338] , the notion of a simple 
lensing neighborhood is introduced which translates the statement that a deflector is transparent 
into precise mathematical language. It is shown that the lens map is globally well-defined and 
differentiable if the source surface is the boundary of such a simple lensing neighborhood, and that 
for each light source that does not meet the caustic of the observer's past light cone the number of 
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images is finite and odd. This result applies, as a special case, to asymptotically simple and empty 
spacetimes (see Section . 

For expressing the exact lens map in coordinate language, it is recommendable to choose co- 
ordinates {x'^ , , x'^ , x^) such that the source surface T is given by the equation x'^ = ccg, with a 
constant Xg , and that the worldlines of the light sources are x^-lines. In this situation the remaining 
coordinates x^ and x'^ label the light sources and the exact lens map takes the form 

iM>,Q)^{x\x'). (5) 

It is given by eliminating the two variables s and x*^ from the four equations (|4]) with F^{s, 5*, 9, r) = 
Xg and fixed r. This is the way in which the lens map was written in the original paper by Frittelli 
and Newman; see Equation (6) in (154) . (They used complex coordinates (77,77) for the observer's 
celestial sphere that are related to our spherical coordinates &) by stereographic projection.) 
In this explicit coordinate version, the exact lens map can be succesfully applied, in particular, 
to spherically symmetric and static spacetimes, with = i, x^ = ip, x^ = "d, and x^ = r (see 
Section 14.31 and the Schwarzschild example in Section 15. ip . The exact lens map can also be used 
for testing the reliability of approximation techniques. In |237j the authors find that the standard 
quasi-Newtonian approximation formalism may lead to significant errors for lensing configurations 
with two lenses. 

2.2 Wave fronts 

Wave fronts are related to light rays as solutions of the Hamilton-Jacobi equation are related to 
solutions of Hamilton's equations in classical mechanics. For the case at hand (i.e., vacuum light 
propagation in an arbitrary spacetime, corresponding to the Hamiltonian 7i — \g^-' {x)piPj), a wave 
front is a subset of the spacetime that can be constructed in the following way: 

1. Choose a spacelike 2-surface S that is orientable. 

2. At each point of S, choose a lightlike direction orthogonal to S that depends smoothly on 
the foot-point. (You have to choose between two possibilities.) 

3. Take all lightlike geodesies that are tangent to the chosen directions. These lightlike geodesies 
are called the generators of the wave front, and the wave front is the union of all generators. 

Clearly, a light cone is a special case of a wave front. One gets this special case by choosing for S 
an appropriate (small) sphere. Any wave front is the envelope of all light cones with vertices on 
the wave front. In this sense, general-relativistic wave fronts can be constructed according to the 
Huygens principle. 

In the context of general relativity the notion of wave fronts was introduced by Kermack, 
McCrea, and Whittaker [233j . For a modern review article see, e.g., Ehlers and Newman |119j . 

A coordinate representation for a wave front can be given with the help of (local) coordinates 
{u^,v?) on S. One chooses a parameter value sq and parametrizes each generator A affinely such 
that A(so) S S and A(so) depends smoothly on the foot-point in S. This gives the wave front as 
the image of a map 

(s,m\u2)i — >F\s,u^,u^), i = 0,1,2,3. (6) 

For light cones we may choose spherical coordinates, [u^ — . v?' — Q)^ (cf. Equation ([4]) with fixed 
r). Near s = so, map ([6]) is an embedding, i.e., the wave front is a submanifold. Orthogonality 
to S of the initial vectors A(so) assures that this submanifold is lightlike. Farther away from 5, 
however, the wave front need not be a submanifold. The caustic of the wave front is the set of 
all points where the map ^ is not an immersion, i.e., where its differential has rank < 3. As the 
derivative with respect to s is always non-zero, the rank can be 3 — 1 (caustic point of multiplicity 
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one, astigmatic focusing) or 3 — 2 (caustic point of multiplicity two, anastigmatic focusing). In 
the first case, the cross-section of an "infinitesimally thin" bundle of generators cohapses to a line, 
in the second case to a point (see Section 12.31) . For the case that the wave front is a light cone 
with vertex po, caustic points are said to be conjugate to po along the respective generator. For 
an arbitrary wave front, one says that a caustic point is conjugate to any spacelike 2-surface in 
the wave front. In this sense, the terms "conjugate point" and "caustic point" are synonymous. 
Along each generator, caustic points are isolated (see Section 12. 3|) and thus denumerable. Hence, 
one may speak of the first caustic, the second caustic, and so on. At all points where the caustic 
is a manifold, it is either spacelike or lightlike. For instance, the caustic of the Schwarzschild light 
cone in Figure [13] is a spacelike curve; in the spacetime of a transparent string, the caustic of the 
light cone consists of two lightlike 2-manifolds that meet in a spacelike curve (see Figure \26\i . 

Near a non-caustic point, a wave front is a hypersurface S = constant where S satisfies the 
Hamilton-Jacobi equation 

g'^{x)d.,S{x)djS{x) ^0. (7) 

In the terminology of optics. Equation ([T]) is called the eikonal equation. 

At caustic points, a wave front typically forms cuspidal edges or vertices whose geometry 
might be arbitrarily complicated, even locally. If one restricts to caustics which are stable against 
perturbations in a certain sense, then a local classification of caustics is possible with the help 
of Arnold's singularity theory of Lagrangian or Legendrian maps. Full details of this theory can 
be found in [14]. For a readable review of Arnold's results and its applications to wave fronts in 
general relativity, we refer again to |119| . In order to apply Arnold's theory to wave fronts, one 
associates each wave front with a Legendrian submanifold in the projective cotangent bundle over 
A4 (or with a Lagrangian submanifold in an appropriately reduced bundle). A caustic point of 
the wave front corresponds to a point where the differential of the projection from the Legendrian 
submanifold to Ai has non-maximal rank. For the case dim(A^) 4, which is of interest here, 
Arnold has shown that there are only five types of caustic points that are stable with respect 
to perturbations within the class of all Legendrian submanifolds. They are known as fold, cusp, 
swallow-tail, pyramid, and purse (see Figure [2]). Any other type of caustic is unstable in the sense 
that it changes non-diffeomorphically if it is perturbed within the class of Legendrian submanifolds. 

Fold singularities of a wave front form a lightlike 2-manifold in spacetime, on a sufficiently small 
neighborhood of any fold caustic point. The second picture in Figure [2] shows such a "fold surface" , 
projected to 3-space along the integral curves of a timelike vector field. This projected fold surface 
separates a region covered twice by the wave front from a region not covered at all. If the wave front 
is the past light cone of an observation event, and if one restricts to light sources with worldlines in 
a sufficiently small neighborhood of a fold caustic point, there are two images for light sources on 
one side and no images for light sources on the other side of the fold surface. Cusp singularities of 
a wave front form a spacelike curve in spacetime, again locally near any cusp caustic point. Such 
a curve is often called a "cusp ridge". Along a cusp ridge, two fold surfaces meet tangentially. 
The third picture in Figure [2] shows the situation projected to 3-space. Near a cusp singularity 
of a past light cone, there is local triple-imaging for light sources in the wedge between the two 
fold surfaces and local single-imaging for light sources outside this wedge. Swallow-tail, pyramid, 
and purse singularities are points where two or more cusp ridges meet with a common tangent, as 
illustrated by the last three pictures in Figure H] 

Friedrich and Stewart 149^ have demonstrated that all caustic types that are stable in the sense 
of Arnold can be realized by wave fronts in Minkowski spacetime. Moreover, they stated without 
proof that, quite generally, one gets the same stable caustic types if one allows for perturbations 
only within the class of wave fronts (rather than within the larger class of Legendrian submanifolds) . 
A proof of this statement was claimed to be given in |187J where the Lagrangian rather than the 
Legendrian formalism was used. However, the main result of this paper (Theorem 4.4 of [187] ) is 
actually too weak to justify this claim. A different version of the desired stability result was indeed 
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Figure 2: Wave fronts that are locally stable in the sense of Arnold. Each picture shows the 
projection into 3-space of a wave-front, locally near a caustic point. The projection is made along 
the integral curves of a timelike vector field. The qualitative features are independent of which 
timelike vector field is chosen. In addition to regular, i.e., non-caustic, points (^i), there are five 
kinds of stable points, known as fold (A2), cusp {A3), swallow-tail {A4), pyramid (D^), and purse 
(D^). The Ak and Dk notation refers to a relation to exceptional groups (see [2]). The picture 
is taken from |187] . 
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proven by another approach. In this approach one concentrates on an instantaneous wave front, i.e., 
on the intersection of a wave front with a spacehke hypersurface C. As an alternative terminology, 
one calls the intersection of a ("big") wave front with a hypersurface C that is transverse to all 
generators a "small wave front" . Instantaneous wave fronts are special cases of small wave fronts. 
The caustic of a small wave front is the set of all points where the small wave front fails to be an 
immersed 2-dimensional submanifold of C. If the spacetime is foliated by spacelike hypersurfaces, 
the caustic of a wave front is the union of the caustics of its small (= instantaneous) wave fronts. 
Such a foliation can always be achieved locally, and in several spacetimes of interest even globally. 
If one identifies different slices with the help of a timelike vector field, one can visualize a wave 
front, and in particular a light cone, as a motion of small (= instantaneous) wave fronts in 3-space. 
Examples are shown in Figures [TH [191 EQI EH and [29l Mathematically, the same can be done for 
non-spacelike slices as long as they are transverse to the generators of the considered wave front 
(see Figure [5T] for an example). Turning from (big) wave fronts to small wave fronts reduces the 
dimension by one. The only caustic points of a small wave front that are stable in the sense of 
Arnold are cusps and swallow-tails. What one wants to prove is that all other caustic points are 
unstable with respect to perturbations of the wave front within the class of wave fronts, keeping 
the metric and the slicing fixed. For spacelike slicings (i.e., for instantaneous wave fronts), this 
was indeed demonstrated by Low i264J . In this article, the author views wave fronts as subsets of 
the space M of all lightlike geodesies in {Ai,g). General properties of this space Af are derived in 
earlier articles by Low |262l 1263) (also see Penrose and Rindler |330) . volume II, where the space 
Af is treated in twistor language). Low considers, in particular, the case of a globally hyperbolic 
spacetime [264) : he demonstrates the desired stability result for the intersections of a (big) wave 
front with Cauchy hypersurfaces (see Section[32])- As every point in an arbitrary spacetime admits 
a globally hyperbolic neighborhood, this local stability result is universal. Figure HH] shows an 
instantaneous wave front with cusps and a swallow-tail point. Figure [T4l shows instantaneous wave 
fronts with caustic points that are neither cusps nor swallow-tails; hence, they must be unstable 
with respect to perturbations of the wave front within the class of wave fronts. 

It is to be emphasized that Low's work allows to classify the stable caustics of small wave fronts, 
but not directly of (big) wave fronts. Clearly, a (big) wave front is a one-parameter family of small 
wave fronts. A qualitative change of a small wave front, in dependence of a parameter, is called a 
"metamorphosis" in the English literature and a "perestroika" in the Russian literature. Combining 
Low's results with the theory of metamorphoses, or perestroikas, could lead to a classsification of 
the stable caustics of (big) wave fronts. However, this has not been worked out until now. 

Wave fronts in general relativity have been studied in a long series of articles by Newman, 
Frittelli, and collaborators. For some aspects of their work see Sections 12.91 and 13.41 In the quasi- 
Newtonian approximation formalism of lensing, the classification of caustics is treated in great 
detail in the book by Fetters, Levine, and Wambsganss 343 . Interesting related mateial can also 
be found in Blandford and Narayan [45) . For a nice exposition of caustics in ordinary optics see 
Berry and Upstill [37,. 

A light source that comes close to the caustic of the observer's past light cone is seen strongly 
magnified. For a point source whose worldline passes exactly through the caustic, the ray-optical 
treatment even gives an infinite brightness (see Section 12. 6p . If a light source passes behind a 
compact deflecting mass, its brightness increases and decreases in the course of time, with a 
maximum at the moment of closest approach to the caustic. Such microlensing events are routinely 
observed by monitoring a large number of stars in the bulge of our Galaxy, in the Magellanic Clouds, 
and in the Andromeda Galaxy (see, e.g., [2 80) for an overview). In his millennium essay on future 
perspectives of gravitational lensing, Blandford [44] mentioned the possibility of observing a chosen 
light source strongly magnified over a period of time with the help of a space-born telescope. The 
idea is to guide the spacecraft such that the worldline of the light source remains in (or close 
to) the one-parameter family of caustics of past light cones of the spacecraft over a period of 
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time. This futuristic idea of "caustic surfing" was mathematically further discussed by Frittelli 
and Fetters !159j. 

2.3 Optical scalars and Sachs equations 

For the calculation of distance measures, of image distortion, and of the brightness of images one 
has to study the Jacobi equation (= equation of geodesic deviation) along lightlike geodesies. This 
is usually done in terms of the optical scalars which were introduced by Sachs et al. |221[ 1360] . 
Related background material on lightlike geodesic congruences can be found in many text-books 
(see, e.g., Wald j425) . Section 9.2). In view of applications to lensing, a particularly useful exposi- 
tion was given by Seitz, Schneider and Ehlers |373] . In the following the basic notions and results 
will be summarized. 

Infinitesimally thin bundles. 

Let s I — > A(s) be an affinely parametrized lightlike geodesic with tangent vector field K — X. We 
assume that A is past-oriented, because in applications to lensing one usually considers rays from 
the observer to the source. We use the summation convention for capital indices A, B, . . . taking 
the values 1 and 2. An infinitesimally thin bundle (with elliptical cross-section) along A is a set 

B= {c'^Ya\c\c^ em, Sabc'^c^<1}- (8) 

Here 6ab denotes the Kronecker delta, and Yi and I2 are two vector fields along A with 

VKyxYA^ R{K,Ya,K), (9) 
g(K,YA) = 0, (10) 

such that Yi{s), Y2{s), and K(s) are linearly independent for almost all s. As usual, R denotes 
the curvature tensor, defined by 

RiX, y, Z) = \/xS/yZ - \/yS/xZ - S/[x,Y]Z. (11) 

Equation ^ is the Jacobi equation. It is a precise mathematical formulation of the statement that 
"the arrow-head of Ya traces an infinitesimally neighboring geodesic" . Equation (ITO|) guarantees 
that this neighboring geodesic is, again, lightlike and spatially related to A. Vector fields Ya that 
satisfy Equation ^ are known as Jacobi vector fields. 

Sachs basis. 

For discussing the geometry of infinitesimally thin bundles it is usual to introduce a Sachs basis, 
i.e., two vector fields Ei and E2 along A that are orthonormal, orthogonal to K = X, and parallelly 
transported, 

g{EA,EB)^5AB, g{K,EA)^0, VkEa^O. (12) 
Apart from the possibility to interchange them, Ei and E2 are unique up to transformations 

El — cosa El + s'ma E2 + aiK, (13) 
E2 = — sina El + cosa E2 + a2K, (14) 

where a, ai, and 02 are constant along A. A Sachs basis determines a unique vector field U with 
g{U,U) = —1 and g{U,K) = 1 along A that is perpendicular to Ei, and E2- As K is assumed 
past-oriented, U is future-oriented. In the rest system of the observer field U , the Sachs basis 
spans the 2-space perpendicular to the ray. It is helpful to interpret this 2-space as a "screen"; 
correspondingly, linear combinations of Ei and E2 are often refered to as "screen vectors" . 
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Jacobi matrix. 

With respect to a Sachs basis, the basis vector fields Yi and Y2 of an inffnitesimahy thin bundle 
can be represented as 

Ya = D^Eb + vaK. (15) 

The Jacobi matrix D — (D^) relates the shape of the cross-section of the infinitesimally thin 
bundle to the Sachs basis (see Figure [3]). Equation ^ implies that D satisfies the matrix Jacobi 
equation 

D ^ DR, (16) 
where an overdot means derivative with respect to the affine parameter s, and 

p_ f^oo \ /-Re(^o) Im(^o)\ 

^-{0 <!>ooJ^\ Im(Vo) Re{^o)J ^ ' 

is the optical tidal matrix, with 

$oo = -^Ric(if,i^), ^o = -lc{Ei-iE2,K,Ei-tE2,K). (18) 

Here Ric denotes the Ricci tensor, defined by Ric(X, Y) = tr (i?(-, X, Y)), and C denotes the con- 
formal curvature tensor (=Weyl tensor). The notation in Equation (jl8l) is chosen in agreement 
with the Newman-Penrose formalism (cf., e.g., 1 75] ) . As Yi, Y2, and K are not everywhere linearly 
dependent, det{D) does not vanish identically. Linearity of the matrix Jacobi equation implies 
that det(£>) has only isolated zeros. These are the "caustic points" of the bundle (see below). 

Shape parameters. 

The Jacobi matrix D can be parametrized according to 

/cosV' -sin?A\ ^D+ \ / cosx sinx\ 
ysimp cos^p / \ D^J sinx cosx) ' 

Here we made use of the well-known facts that any matrix can be written as the product of an 
orthogonal and a symmetric matrix and that any symmetric matrix can be diagonalized by an 
orthogonal transformation. Our definition of infinitesimally thin bundles implies that and I?_ 
are non-zero almost everywhere. In the representation of Equation (jl9p . the extremal points of the 
bundle's elliptical cross-section are given by the position vectors 

Y+ = cos -0 Yi + sin ^Y2~ D+ (cos x -Bi + sin x ^^2) , (20) 
yi = -sinV-Yi +cosi)Y2 ~ -D- (-sinx^^i -t-cosx-E'2), (21) 

where ~ means equality up to multiples of K. Hence, and \D^\ give the semi-axes of the 

elliptical cross-section and x gives the angle by which the ellipse is rotated with respect to the 
Sachs basis (see Figure [3]). We call D+, and x the shape parameters of the bundle. This name 
is taken from Frittelli, Kling, and Newman |152[ 1151] who actually use, instead of D+ and Z)_, 
the equivalent quantities D+D_ and For the case that the infinitesimally thin bundle 

can be embedded in a wave front, the shape parameters Z)+ and D- have the following interesting 
property (see Kantowski et al. [2221 1110] ). D+/D^ and D^/D^ give the principal curvatures 
of the wave front in the rest system of the observer field U which is perpendicular to the Sachs 
basis. The notation D+ and which is taken from |110] . is convenient because it often allows 
to write two equations in the form of one equation with a ± sign (see, e.g.. Equation (|27p or 
Equation (1^5]) below). The angle x can be directly linked with observations if a light source emits 
linearly polarized light (see Section [^3]) . 
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For any infinitesimally thin bundle, given in terms of Yi and I2, we can choose the Sachs 
basis as we Hke. This freedom leads to two ambiguities in the definition of _D+ and Z3_ . Firstly, 
the transformation {Ei,E2) (~Ei,E2) results in (_D+, x, V') {^D+, D^, —x,tp), and the 
analogous transformation {Ei,E2) i?2) results in — Z)-, — x, "0); 

this shows that the signs of and £)_ are ambiguous. Secondly, the transformation {Ei, E2) n- 
{E2,—Ei) results in D_, x, V') (^-j -0+, x, V' + '''■/2); this shows that and D_ can be 
interchanged. The most interesting case for us is that of an infinitesimally thin bundle that issues 
from a vertex at an observation event po = -^(0) into the past. For such bundles we can remove 
the sign ambiguity in the definition of £'-|-(s) and I?_(s) by requiring that they are positive for 
small positive values of s. The freedom of interchanging them can be removed, e.g., by requiring 
that D-^-{s) > D-{s) for small positive values of s; for spherically symmetric and static spacetimes, 
however, another convention is more convenient, see Section 14.31 below. If we have chosen a 
convention that makes -D+ and Z3_ unique along the bundle, the Sachs basis can still be changed 
by a transformation (jl31ll4p . Under such a transformation the shape parameters change according 
to D± = D±, X = X ~ 01^ tp = tp. This demonstrates the important fact that the shape and the 
size of the cross-section of an infinitesimally thin bundle have an invariant (observer-independent) 
meaning [360]. 




Figure 3: Cross-section of an infinitesimally thin bundle. The Jacobi matrix ()19|) relates the 
Jacobi fields Yi and I2 that span the bundle to the Sachs basis vectors Ei and £'2. The shape 
parameters Z?+, Z?_, and x determine the outline of the cross-section; the angle ip that appears 
in Equation (|19p does not show in the outline. The picture shows the projection into the 2-space 
("screen") spanned by Ei and E2] note that, in general, Yi and Y2 have components perpendicular 
to the screen. 
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Optical scalars. 

Along each infinitesimally thin bundle one defines the deformation matrix S by 



D = DS. (22) 



This reduces the second-order linear differential equation (|16p for D to a first-order non-linear 
diff'erential equation for S, 

S + SS = R. (23) 
It is usual to decompose S into antisymmetric, symmetric-tracefree, and trace parts, 

This defines the optical scalars lo {twist), 6 {expansion), and (ai, <T2) {shear). One usually combines 
them into two complex scalars g = 6 + iuj and a = ai + ia2 ■ A change (|13[ [Ti)) of the Sachs basis 
affects the optical scalars according to g = g and a = e~^*"iT. Thus, g and \a\ are invariant. If 
rewritten in terms of the optical scalars, Equation (j23p gives the Sachs equations 

^= -e'- |(Tp + $oo, (25) 
& = -a{g+g)+tl:o- (26) 

One sees that the Ricci curvature term $oo directly produces expansion (focusing) and that the 
conformal curvature term ipo directly produces shear. However, as the shear appears in Equa- 
tion (|25p. conformal curvature indirectly influences focusing (cf. Penrose [328] ). With D written in 
terms of the shape parameters and S written in terms of the optical scalars. Equation results 
in 

D±+ixD±-iijD^^ {p±e-^'^a)D±. (27) 

Along A, Equations ([25l [26]) give a system of 4 real first-order differential equations for the 4 real 
variables g and a; if g and a are known. Equation (f27ll gives a system of 4 real first-order differential 
equations for the 4 real variables D±, x, and ip. The twist-free solutions {g real) to Equations (^51 
[26)1 constitute a 3-dimensional linear subspace of the 4-dimensional space of all solutions. This 
subspace carries a natural metric of Lorentzian signature, unique up to a conformal factor, and 
was nicknamed Minikowski space in [26] . 



Conservation law. 

As the optical tidal matrix R is symmetric, for any two solutions Di and D2 of the matrix Jacobi 
equation (|16[) we have 

DiD^ ~ DiD^ = constant, (28) 

where ( )'^ means transposition. Evaluating the case Di = D2 shows that for every infinitesimally 
thin bundle 

LuD+D^ — constant. (29) 

Thus, there are two types of infinitesimally thin bundles: those for which this constant is non-zero 
and those for which it is zero. In the first case the bundle is twisting {uj ^ everywhere) and 
its cross-section nowhere collapses to a line or to a point ^ and £)_ 7^ everywhere). 

In the second case the bundle must be non- twisting (w = everywhere), because our definition 
of infinitesimally thin bundles implies that D+ 7^ and D_ 7^ almost everywhere. A quick 
calculation shows that cj = is exactly the integrability condition that makes sure that the 
infinitesimally thin bundle can be embedded in a wave front. (For the definition of wave fronts see 
Section [2.21 ) In other words, an infinitesimally thin bundle is twist-free if and only if we can find 
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a wave front such that A is one of the generators and the vector fields Yi and I2 connect A with 
infinitesimally neighboring generators. For a (necessarily twist-free) infinitesimally thin bundle, 
points where one of the two shape parameters -D+ and _D_ vanishes are called caustic points of 
multiplicity one, and points where both shape parameters Dj^ and _D_ vanish are called caustic 
points of multiplicity two. This notion coincides exactly with the notion of caustic points, or 
conjugate points, of wave fronts as introduced in Section 12.21 The behavior of the optical scalars 
near caustic points can be deduced from Equation (|27p with Equations (f25l [26]) . For a caustic 
point of multiplicty one at s = sq one finds 

e{s) = —^—{l + 0{s-so)), (30) 
2(s - So) 

\o{s)\ = —^{l + 0{s-s,)). (31) 
2(s - So) 

By contrast, for a caustic point of multiplicity two at s = so the equations read (cf. [373] ) 

e{s) ^ + 0{s - so), (32) 
s - So 

a{s) = ^Mso){s - 5o) + O ((s - so)') . (33) 
Infinitesimally thin bundles with vertex. 

We say that an infinitesimally thin bundle has a vertex at s = so if the Jacobi matrix satisfies 

r>(so) - 0, b[so) = 1. (34) 

A vertex is, in particular, a caustic point of multiplicity two. An infinitesimally thin bundle with a 
vertex must be non-twisting. While any non-twisting infinitesimally thin bundle can be embedded 
in a wave front, an infinitesimally thin bundle with a vertex can be embedded in a light cone. 
Near the vertex, to within a first-order approximation with respect to s — so, it has a circular 
cross-section. If Z?i has a vertex at si and D2 has a vertex at S2, the conservation law (|28p implies 

Z>^(si) ==-r>i(s2). (35) 

This is Etherington's |133| reciprocity law. The method by which this law was proven here follows 
Ellis jl27) (cf. Schneider, Ehlers, and Falco [367 ). Etherington's reciprocity law is of relevance, in 
particular in view of cosmology, because it relates the luminosity distance to the area distance (see 
Equation (|T7|) ). It was independently rediscovered in the 1960s by Sachs and Penrose (see [3281 
[2451 1. 

The results of this section are the basis for Sections 12. 4[ 12.51 and 12.61 



2.4 Distance measures 

In this section we summarize various distance measures that are defined in an arbitrary spacetime. 
Some of them are directly related to observable quantities with relevance for lensing. The material 
of this section makes use of the results on infinitesimally thin bundles which are summarized in 
Section 12.31 All of the distance measures to be discussed refer to a past-oriented lightlike geodesic 
A from an observation event po to an emission event ps (see Figure [4}. Some of them depend 
on the 4- velocity Uo of the observer at po and/or on the 4- velocity Us of the light source at ps- 
If a vector field U with g{U, U) — —1 is distinguished on A^, we can choose for the observer an 
integral curve of U and for the light sources all other integral curves of U . Then each of the dis- 
tance measures becomes a function of the observational coordinates (s, ^, 0, r) (recall Section [^?T|) . 
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Figure 4: Past-oriented lightlike geodesic A from an observation event po to an emission event pg. 
70 is the worldline of the observer, 73 is the worldline of the light source. Uq is the 4-velocity of 
the observer at po and Us is the 4-velocity of the light source at ps ■ 

Affine distance. 

There is a unique affine parametrization s 1 — >■ A(s) for each lightlike geodesic through the obser- 
vation event po such that A(0) — po and g ^A(O), Uq^ — 1- Then the affine parameter s itself can 
be viewed as a distance measure. This affine distance has the desirable features that it increases 
monotonously along each ray and that it coincides in an infinitesimal neighborhood of po with 
Euclidean distance in the rest system of Uq- The affine distance depends on the 4-velocity Uq of 
the observer but not on the 4-velocity Us of the light source. It is a mathematically very convenient 
notion, but it is not an observable. (It can be operationally realized in terms of an observer field 
whose 4- velocities are parallel along the ray. Then the affine distance results by integration if each 
observer measures the length of an infinitesimally short part of the ray in his rest system. However, 
in view of astronomical situations this is a purely theoretical construction.) The notion of affine 
distance was introduced by Kcrmack, McCrea, and Whittaker |233] . 

Travel time. 

As an alternative distance measure one can use the travel time. This requires the choice of a time 
function, i.e., of a function t that slices the spacetime into spacelike hypersurfaces t — constant. 
(Such a time function globally exists if and only if the spacetime is stably causal; see, e.g., jl93j . 
p. 198.) The travel time is equal to t{po) — t{ps), for each ps on the past light cone of po- In 
other words, the intersection of the light cone with a hypersurface t = constant determines events 
of equal travel time; we call these intersections "instantaneous wave fronts" (recall Section [2?2|) . 
Examples of instantaneous wave fronts are shown in Figures [HI \19\ [20l [28l and [29l The travel 
time increases monotonously along each ray. Clearly, it depends neither on the 4-velocity Uq of 
the observer nor on the 4-velocity Us of the light source. Note that the travel time has a unique 
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value at each point of po's past light cone, even at events that can be reached by two different 
rays from po- Near po the travel time coincides with Euclidean distance in the observer's rest 
system only if Uq is perpendicular to the hypersurface t = constant with dtiUo) = 1. (The lat- 
ter equation is true if along the observer's world line the time function t coincides with proper 
time.) The travel time is not directly observable. However, travel time differences are observable 
in multiple-imaging situations if the intrinsic luminosity of the light source is time-dependent. To 
illustrate this, think of a light source that flashes at a particular instant. If the flash reaches the 
observer's wordline along two different rays, the proper time difference Atq of the two arrival 
events is directly measurable. For a time function t that along the observer's worldline coincides 
with proper time, this observed time delay Aro gives the difference in travel time for the two rays. 
In view of applications, the measurement of time delays is of great relevance for quasar lensing. For 
the double quasar 0957+561 the observed time delay Atq is about 417 days (see, e.g., [343], p. 149). 

Redshift. 

In cosmology it is common to use the redshift as a distance measure. For assigning a redshift to 
a lightlike geodesic A that connects the observation event po on the worldline 70 of the observer 
with the emission event ps on the worldline 7s of the light source, one considers a neighboring 
lightlike geodesic that meets 70 at a proper time interval Atq from po and 73 at a proper time 
interval Ars from ps ■ The redshift z is defined as 

z — lim — ^. (36) 

Ars-i-O Ars 

If A is affinely parametrized with A(0) — po and A(s) — ps, one finds that z is given by 



g(A(o),c/o) 

l + z = ^^ (37) 
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{Hs),Us 



This general redshift formula is due to Kermack, McCrea, and Whittaker [233 . Their proof 
is based on the fact that g{\, Y) is a constant for all Jacobi fields Y that connect A with an 
infinitesimally neighboring lightlike geodesic. The same proof can be found, in a more elegant 
form, in |i59^ and in [389] . p. 109. An alternative proof, based on variational methods, was given 
by Schrodinger [368J . Equation (j37l) is in agreement with the Hamilton formalism for photons. 
Clearly, the redshift depends on the 4- velocity C/q of the observer and on the 4- velocity Us of the 
light source. If a vector field U with g{U, U) = —I has been distinguished on A4, we may choose 
one integral curve of U as the observer and all other integral curves of U as the light sources. 
Then the redshift becomes a function of the observational coordinates (s, '5, 8, r). For s — >■ 0, the 
redshift goes to 0, 

z{s,^,e,T) = h{-i',e,T)s + 0{s^), (38) 

with a (generalized) Hubble parameter /i(^,0,t) that depends on spatial direction and on time. 
For criteria that h and the higher-order coefficients are independent of ^I' and 9 see |190) . If the 
redshift is known for one observer field [/, it can be calculated for any other ?7, according to Equa- 
tion (I57|) . just by adding the usual special-relativistic Doppler factors. Note that if Uq is given, 
the redshift can be made to zero along any one ray A from po by choosing the 4-velocities J7a(s) 
appropriately. This shows that z is a reasonable distance measure only for special situations, e.g., 
in cosmological models with U denoting the mean flow of luminous matter ("Hubble flow"). In 
any case, the redshift is directly observable if the light source emits identifiable spectral lines. For 
the calculation of Sagnac-like effects, the redshift formula ([37]) can be evaluated piecewise along 
broken lightlike geodesies [^ . 
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Angular diameter distances. 

The notion of angular diameter distance is based on the intuitive idea that the farther an object 
is away the smaher it looks, according to the rule 

object diameter = angle x distance. (39) 

The formal definition needs the results of Section r2.3l on infinitesimally thin bundles. One considers 
a past-oriented lightlike geodesic s — >■ A(s) parametrized by affine distance, i.e., A(0) — po and 
g(^X{0),Uo) = 1, and along A an infinitesimally thin bundle with vertex at the observer, i.e., at 
s — 0. Then the shape parameters D+{s) and D_{s) (recall Figure[3]) satisfy the initial conditions 
-D±(0) = and D±{0) = 1. They have the following physical meaning. If the observer sees a cir- 
cular image of (small) angular diameter a on his or her sky, the (small but extended) light source 
at afhne distance s actually has an elliptical cross-section with extremal diameters a|D±(s)|. It is 
therefore reasonable to call Z?+ and D- the extremal angular diameter distances. Near the vertex, 
Z)+ and D_ are monotonously increasing functions of the afhne distance, D±{s) = s + O(s^). 
Farther away from the vertex, however, they may become decreasing, so the functions s i-^ -D+(s) 
and s I— > D_ (s) need not be invertible. At a caustic point of multiplicity one, one of the two 
functions and I?_ changes sign; at a caustic point of multiplicity two, both change sign (re- 
call Section l2.3p . The image of a light source at affine distance s is said to have even parity if 
D_^.{s)D_{s) > and odd parity if D+{s)D_{s) < 0. Images with odd parity show the neigh- 
borhood of the light source side-inverted in comparison to images with even parity. Clearly, _D_|_ 
and Z3_ are reasonable distance measures only in a neighborhood of the vertex where they are 
monotonously increasing. However, the physical relevance of Z?+ and D- lies in the fact that they 
relate cross-sectional diameters at the source to angular diameters at the observer, and this is 
always true, even beyond caustic points. and £)_ depend on the 4-velocity Uq of the observer 
but not on the 4-velocity Us of the source. This reflects the fact that the angular diameter of an 
image on the observer's sky is subject to aberration whereas the cross-sectional diameter of an 
infinitesimally thin bundle has an invariant meaning (recall Section [^75]) . Hence, if the observer's 
worldline 70 has been specified, ZJ^ and I?_ are well-defined functions of the observational coor- 
dinates (s, ^E", 0, r). 



Area distance. 

The area distance -Daroa is defined according to the idea 

object area = solid angle x distance^. (40) 

As a formal definition for Darca, in terms of the extremal angular diameter distances and _D_ 
as functions of affine distance s, we use the equation 

i5arca(s) = ^\D+{s) D-{s)\. (41) 

-Darea(s)^ indeed relates, for a bundle with vertex at the observer, the cross-sectional area at the 
source to the opening solid angle at the observer. Such a bundle has a caustic point exactly at 
those points where -Darea(s) = 0. The area distance is often called "angular diameter distance" 
although, as indicated by Equation the name "averaged angular diameter distance" would 

be more appropriate. Just as D+ and £>_ , the area distance depends on the 4-velocity Uq of the 
observer but not on the 4-velocity Us of the light source. The area distance is observable for a 
light source whose true size is known (or can be reasonably estimated). It is sometimes convenient 
to introduce the magnification or amplification factor 

= n r Tn ( ^ ■ (^2) 
D+{s)D^{s) 
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The absolute value of /i determines the area distance, and the sign of /i determines the parity. 
In Minkowski spacetime, D±(s) = s and, thus, /i(s) = 1. Hence, > 1 means that a (small 

but extended) light source at affine distance s subtends a larger solid angle on the observer's sky 
than a light source of the same size at the same affine distance in Minkowski spacetime. Note that 
in a multiple-imaging situation the individual images may have different affine distances. Thus, 
the relative magnification factor of two images is not directly observable. This is an important 
difference to the magnification factor that is used in the quasi-Newtonian approximation formalism 
of lensing. The latter is defined by comparison with an "unlensed image" (see, e.g., [3671 ), a notion 
that makes sense only if the metric is viewed as a perturbation of some "background" metric. One 
can derive a differential equation for the area distance (or, equivalently, for the magnification 
factor) as a function of affine distance in the following way. On every parameter interval where 
D+D- has no zeros, the real part of Equation ([27]) shows that the area distance is related to the 
expansion by 

-Darca = ^'-Darca- (43) 

Insertion into the Sachs equation (|25|) for 9 = g gives the focusing equation 

i^arca = " + \ Ric(A, A)^ Z^aroa- (44) 

Between the vertex at s = and the first conjugate point (caustic point), -Daroa is determined by 
Equation (|44p and the initial conditions 

i?arca(0) = 0, I)arca(O) = 1. (45) 

The Ricci term in Equation (|44l) is non-negative if Einstein's field equation holds and if the energy 
density is non- negative for all observers ( "weak energy condition" ) . Then Equations (|441 H5|) imply 
that 

i?arca(s) < S, (46) 

i.e., 1 < ^(s), for all s between the vertex at s = and the first conjugate point. In Minkowski 
spacetime, the equation £'aica(s) = s holds. Hence, the inequality says that a gravitational 
field has a focusing, as opposed to a defocusing, effect. This is sometimes called the focusing the- 
orem. 



Corrected luminosity distance. 

The idea of defining distance measures in terms of bundle cross-sections dates back to Tolman |404j 
and Whittaker [435) . Originally, this idea was applied not to bundles with vertex at the observer 
but rather to bundles with vertex at the light source. The resulting analogue of the area distance 
is the so-called corrected luminosity distance D[^^. It relates, for a bundle with vertex at the 
light source, the cross-sectional area at the observer to the opening solid angle at the light source. 
Owing to Etherington's reciprocity law p5p . area distance and corrected luminosity distance are 
related by 

^I'um = (1 + ^)^aroa. (47) 

The redshift factor has its origin in the fact that the definition of D[^^ refers to an affine parametriza- 
tion adapted to Us, and the definition of Carea refers to an affine parametrization adapted to Uq- 
While Darca dcpcnds on J7o but not on Us, -Dfum depends on Us but not on Uq- 

Luminosity distance. 

The physical meaning of the corrected luminosity distance is most easily understood in the photon 
picture. For photons isotropically emitted from a light source, the percentage that hit a prescribed 
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area at the observer is proportional to 1/{D[^^'^. As the energy of each photon undergoes a 
redshift, the energy flux at the observer is proportional to l/(I?ium)^, where 

-Dlum = (1 + ^) A'um = (1 + ^)'^arca- (48) 

Thus, -Dium is the relevant quantity for calculating the luminosity (apparent brightness) of point- 
like light sources (see Equation (|52|) '). For this reason Dium is called the (uncorrected) luminosity 
distance. The observation that the purely geometric quantity D[^^ must be modified by an addi- 
tional redshift factor to give the energy flux is due to Walker |426] . Dium depends on the 4- velocity 
Uo of the observer and of the 4-velocity C/s of the light source. Dium and D[^^ can be viewed as 
functions of the observational coordinates (s, 5',9,r) if a vector field U with g{U,U) = — 1 has 
been distinguished, one integral curve of U is chosen as the observer, and the other integral curves 
of U are chosen as the light sources. In that case Equation (|38|) implies that not only i?area(s) but 
also Diujn{s) and D[^^{s) are of the form s + 0{s'^). Thus, near the observer aU three distance 
measures coincide with Euclidean distance in the observer's rest space. 

Parallax distance. 

In an arbitrary spacetime, we fix an observation event po and the observer's 4-velocity Uq. We 
consider a past-oriented lightlike geodesic A parametrized by affine distance, A(0) = po and 

g^A(0),[/oj =1- To a light source passing through the event A(s) we assign the (averaged) 

parallax distance £'par(s) = —0{O)~^, where 9 is the expansion of an infinitesimally thin bundle 
with vertex at A(s). This definition follows |221j . Its relevance in view of cosmology was discussed 
in detail by Rosquist |357| . Dpar can be measured by performing the standard trigonometric par- 
allax method of elementary Euclidean geometry, with the observer at po and an assistant observer 
at the perimeter of the bundle, and then averaging over all possible positions of the assistant. Note 
that the method refers to a bundle with vertex at the light source, i.e., to light rays that leave the 
light source simultaneously. (Averaging is not necessary if this bundle is circular.) Dpar depends 
on the 4-velocity of the observer but not on the 4-velocity of the light source. To within first-order 
approximation near the observer it coincides with affine distance (recall Equation p2p ). For the 
potential obervational relevance of Dpar see [357] . and [367) . p. 509. 

In view of lensing, , D- , and -Dium are the most important distance measures because they 
are related to image distortion (see Section [2751) and to the brightness of images (see Section [276]) . 
In spacetimes with many symmetries, these quantities can be explicitly calculated (see Section [4. II 
for conformally fiat spacetimes, and Section 23] for spherically symmetric static spacetimes). This 
is impossible in a spacetime without symmetries, in particular in a realistic cosmological model 
with inhomogeneities ("clumpy universe"). Following Kristian and Sachs [243] , one often uses 
series expansions with respect to s. For statistical considerations one may work with the focusing 
equation in a Friedmann-Robertson- Walker spacetime with average density (see Section [47T]) . or 
with a heuristically modified focusing equation taking clumps into account. The latter leads to 
the so-called Dyer-Roeder distance [1121 1113] which is discussed in several text-books (see, e.g., 
[367]). (For pre-Dyer-Roeder papers on optics in cosmological models with inhomogeneities, see the 
historical notes in [223] .) As overdensities have a focusing and underdensities have a defocusing 
effect, it is widely believed (following 1428] ) that after averaging over sufficiently large angular 
scales the Friedmann-Robertson- Walker calculation gives the correct distance-redshift relation. 
However, it was argued by Ellis, Bassett, and Dunsby [129] that caustics produced by the lensing 
effect of overdensities lead to a systematic bias towards smaller angular sizes ( "shrinking" ) . For a 
spherically symmetric inhomogeneity, the effect on the distance-redshift relation can be calculated 
analytically [291] . For thorough discussions of light propagation in a clumpy universe also see Pyne 
and Birkinshaw [35^, and Holz and Wald [203] . 
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2.5 Image distortion 

In special relativity, a spherical object always shows a circular outline on the observer's sky, in- 
dependent of its state of motion |325l I402J . In general relativity, this is no longer true; a small 
sphere usually shows an elliptic outline on the observer's sky. This distortion is caused by the 
shearing effect of the spacetime geometry on light bundles. For the calculation of image distortion 
we need the material of Sections 12.31 and 12.41 For an observer with 4- velocity Uq at an event pq , 
there is a unique afBne parametrization s i — > A(s) for each lightlike geodesic through po such 
that A(0) = Po and (;(A(0), Uq) = 1. Around each of these A we can consider an infinitesimally 
thin bundle with vertex at s = 0. The elliptical cross-section of this bundle can be characterized 
by the shape parameters _D+(s), -D-(s) and (recall Figure |3|). As outlined in Section [23| we 
choose the convention of having Z?+(s) and D_(s) positive for small positive s. In the terminology 
of Section \TM s is the afHne distance, and D+{s) and -D-(s) are the extremal angular diameter 
distances. The complex quantity 

is called the ellipticity of the bundle. The phase of e determines the position angle of the elliptical 
cross-section of the bundle with respect to the Sachs basis. The absolute value of e(s) determines 
the eccentricity of this cross-section; e(s) — indicates a circular cross-section and |e(s)| = oo 
indicates a caustic point of multiplicity one. (It is also common to use other measures for the 
eccentricity, e.g., |D+ — D-\/\D+ + -D_|.) From Equation (|27p with g = 6 we get the derivative of 
e with respect to the afhne distance s. 



e = 2aV|eP-H4. (50) 

The initial conditions D±{0) = 0, D±{0) = 1 imply 

e(0)=0. (51) 

Equation (|50p and Equation (I51|) determine e if the shear a is known. The shear, in turn, is 
determined by the Sachs equations (l?51 [^5]) and the initial conditions (|32[ [551) with sq — for 
0{— g) and a. 

It is recommendable to change from the e determined this way to e = — e. This transformation 
corresponds to replacing the Jacobi matrix D by its inverse. The original quantity e(s) gives the 
true shape of objects at afhne distance s that show a circular image on the observer's sky. The 
new quantity e{s) gives the observed shape for objects at afline distance s that actually have a 
circular cross-section. In other words, if a (small) spherical body at afhne distance s is observed, 
the ellipticity of its image on the observer's sky is given by e(s). 

By Equations ((50l ISTj) . e vanishes along the entire ray if and only if the shear a vanishes 
along the entire ray. By Equations (|26[ the shear vanishes along the entire ray if and only 
if the conformal curvature term vanishes along the entire ray. The latter condition means 
that if = A is tangent to a principal null direction of the conformal curvature tensor (see, e.g., 
Chandrasekhar (75jV At a point where the conformal curvature tensor is not zero, there are at 
most four different principal null directions. Hence, the distortion effect vanishes along all light 
rays if and only if the conformal curvature vanishes everywhere, i.e., if and only if the spacetime 
is conformally flat. This result is due to Sachs [360]. An alternative proof, based on expressions 
for image distortions in terms of the exponential map, was given by Hasse |186| . 

For any observer, the distortion measure e — — e is defined along every light ray from every point 
of the observer's worldline. This gives e as a function of the observational coordinates (s, ^E", 8, r) 
(recall Section [2?T1 in particular Equation ([4])). If we fix r and s, e is a function on the observer's 
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sky. (Instead of s, one may choose any of the distance measures discussed in Section [231 provided 
it is a unique function of s.) In spacetimes with sufficiently many symmetries, this function can be 
explicitly determined in terms of integrals over the metric function. This will be worked out for 
spherically symmetric static spacetimes in Section [4.3l A general consideration of image distortion 
and example calculations can also be found in papers by Frittelli, Kling and Newman [1521 1151j . 
Frittelli and Oberst [158] calculate image distortion by a "thick gravitational lens" model within 
a spacetime setting. 

In cases where it is not possible to determine e by explicitly integrating the relevant differential 
equations, one may consider series expansions with respect to the afhne parameter s. This tech- 
nique, which is of particular relevance in view of cosmology, dates back to Kristian and Sachs |243j 
who introduced image distortion as an observable in cosmology. In lowest non-vanishing order, 
e(s, 0, To) is quadratic with respect to s and completely determined by the conformal curvature 
tensor at the observation event po = 7(to), as can be read from Equations ([501 W\\ [331) • One 
can classify all possible distortion patterns on the observer's sky in terms of the Petrov type of 
the Weyl tensor [78]. As outlined in [78], these patterns are closely related to what Penrose and 
Rindler [330] call the fingerprint of the Weyl tensor. At all observation events where the Weyl 
tensor is non-zero, the following is true. There are at most four points on the observer's sky where 
the distortion vanishes, corresponding to the four (not necessarily distinct) principal null directions 
of the Weyl tensor. For type N, where all four principal null directions coincide, the distortion 
pattern is shown in Figure [5] 
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Figure 5: Distortion pattern. The picture shows, in a Mercator projection with $ as the horizontal 
and as the vertical coordinate, the celestial sphere of an observer at a spacetime point where the 
Weyl tensor is of Petrov type N . The pattern indicates the elliptical images of spherical objects 
to within lowest non-trivial order with respect to distance. The length of each line segment is a 
measure for the eccentricity of the elliptical image, the direction of the line segment indicates its 
major axis. The distortion effect vanishes at the north pole = which corresponds to the fourfold 
principal null direction. Contrary to the other Petrov types, for type N the pattern is universal up 
to an overall scaling factor. The picture is taken from [7S] where the distortion patterns for the 
other Petrov types are given as well. 

The distortion effect is routinely observed since the mid-1980s in the form of arcs and (radio) 
rings (see |367[I343|I427] for an overview). In these cases a distant galaxy appears strongly elongated 
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in one direction. Such strong elongations occur near a caustic point of multiplicity one where 
|e| — > oo. In the case of rings and (long) arcs, the entire bundle cannot be treated as infinitesimally 
thin, i.e., a theoretical description of the effect requires an integration. For the idealized case of a 
point source, images in the form of (1-dimensional) rings on the observer's sky occur in cases of 
rotational symmetry and are usually called "Einstein rings" (see Section |473|) . The rings that are 
actually observed show extended sources in situations close to rotational symmetry. 

For the majority of galaxies that are not distorted into arcs or rings, there is a "weak lensing" 
effect on the apparent shape that can be investigated statistically. The method is based on the 
assumption that there is no prefered direction in the universe, i.e., that the axes of (approximately 
spheroidal) galaxies are randomly distributed. So, without a distortion effect, the axes of galaxy 
images should make a randomly distributed angle with the (^, Q) grid on the observer's sky. Any 
deviation from a random distribution is to be attributed to a distortion effect, produced by the 
gravitational field of intervening masses. With the help of the quasi-Newtonian approximation, 
this method has been elaborated into a sophisticated formalism for determining mass distributions, 
projected onto the plane perpendicular to the line of sight, from observed image distortions. This 
is one of the most important astrophysical tools for detecting (dark) matter. It has been used 
to determine the mass distribution in galaxies and galaxy clusters, and to probe the large-scale 
structure of the universe (see [351 1206] for reviews). From a methodological point of view, it would 
be desirable to analyse this important line of astronomical research within a spacetime setting. 
This should give prominence to the role of the conformal curvature tensor. 

Another interesting way of observing weak image distortions is possible for sources that emit 
linearly polarized radiation. This is true for many radio galaxies. (Polarization measurements 
are also relevant for strong-lensing situations; see Schneider, Ehlers, and Falco |367j . p. 82 for 
an example.) The method is based on the geometric optics approximation of Maxwell's theory. 
In this approximation, the polarization direction is parallel along each ray between source and 
observer [114] (cf., e.g., |279J . p. 577). We may, thus, choose the Sachs basis {Ei,E2) such that 
the plane spanned by K and Ei gives the polarization direction. This fixes the Sachs basis up to 
transformations (|14|) with a = 0, i.e., it gives an unambiguous (observer-independent) meaning to 
the angle x in Figure [31 If a light source (e.g., a galaxy) shows an approximately elliptic shape 
on the observer's sky, it is reasonable to assume that at the light source the polarization direction 
is aligned with one of the axes, i.e., 2x{s)/tt S Z. A distortion effect is verified if the observed 
polarization direction is not aligned with an axis of the image, 2x(0)/7r <^ Z. It is to be emphasized 
that such a change of the angle x along the ray cannot be the result of a rotation; the bundles 
under consideration have a vertex and are, thus, twist-free. It can only be the result of successive 
shearing processes, governed by the behaviour of the conformal curvature tensor along the ray. 
Also, the effect has nothing to do with the rotation of an observer field; we have already stressed 
that the angle x is observer- independent. Related misunderstandings have been clarified by Panov 
and Sbytov |322[I323] . So far, this distortion effect has not been observed. (Panov and Sbytov |322j 
have clearly shown that an anisotropy observed by Birch [42] , even if real, cannot be interpreted 
in this way.) Its future detectability is estimated, for distant radio sources, in 396 . 

The effect of a gravitatational field on the polarization direction of light was first discussed by 
Skrotskii [383' in 1957 and is therefore sometimes called the "Skrotskii effect" . If the spacetime is 
conformally stationary, and if the worldlines of observers and light sources are integral curves of 
the conformal Killing vector field, the effect can be expressed in terms of the "Fermat geometry" of 
3-space [189j . see Section below for the definition of the Fermat geometry. (Note that Figure 1 
in [1891 is erroneous because it ignores the fact that, in general, the principal shear directions of a 
bundle are not parallel along the central ray.) Relative to a frame that is parallel with respect to the 
Fermat metric, one finds a rotation of the polarization direction that is analogous to the well-known 
Faraday rotation in a magnetic field. In this analogy, the magnetic field corresponds to the rotation 
(twist) of the conformal Killing vector field. Because of this analogy, the Skrotskii effect is also 
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known as the "gravitational Faraday effect" . It has been quite extensively discussed for stationary 
spacetimes and, in particular, for the Kerr metric (see, e.g., [TtSI [5551 [Ti^ [^TT] I5TU1 157i] ) . All these 
articles give formulas for the rotation of the polarization direction relative to a frame distinguished 
by the symmetry assumptions. This rotation should not be confused with the above-mentioned 
motion of the polarization direction relative to the orientation of the image. The latter is a 
distortion effect, governed by the conformal curvature tensor; the former is a gravitomagnetic 
effect, governed by the rotation of a distinguished observer field. 

2.6 Brightness of images 

For calculating the brightness of images we need the definitions and results of Section 12.41 In 
particular we need the luminosity distance I?ium and its relation to other distance measures. We 
begin by considering a point source (worldline) that emits isotropically with (bolometric, i.e., 
integrated over all frequencies) luminosity L. By definition of D\um, in this case the energy flux at 
the observer is 

i*" is a measure for the brightness of the image on the observer's sky. The magnitude m used by 
astronomers is essentially the negative logarithm of F, 

m = 2.51ogio (Aum^) - 2.b\Qg^Q{L) +too, (53) 

with mo being a universal constant. In Equation (|52l) . Z^ium can be expresed in terms of the area 
distance /?area and the redshift z with the help of the general relation (|48)) . This demonstrates that 
the magnification factor /i, which is defined by Equation (H^ . admits the following reinterpretation. 
|/i(s)| relates the flux from a point source at affine distance s to the flux from a point source with 
the same luminosity at the same affine distance and at the same redshift in Minkowski spacetime. 

D\un\ can be explicitly calculated in spacetimes where the Jacobi fields along lightlike geodesies 
can be explicitly determined. This is true, e.g., in spherically symmetric and static spacetimes 
where the extremal angular diameter distances and D_ can be calculated in terms of integrals 
over the metric coefficients. The resulting formulas are given in Section 14.31 below. Knowledge 
of and immediately gives the area distance i^arca via Equation (|4ip . -Darca together with 
the redshift determines i^ium via Equation (j48p . Such an explicit calculation is, of course, possible 
only for spacetimes with many symmetries. 

By Equation (|48[) . the zeros of Dium coincide with the zeros of -Darca, i-C, with the caustic points. 
Hence, in the ray-optical treatment a point source is infinitely bright (magnitude m = — oo) if it 
passes through the caustic of the observer's past light cone. A wave-optical treatment shows 
that the energy flux at the observer is actually bounded by diffraction. In the quasi-Newtonian 
approximation formalism, this was demonstrated by an explicit calculation for light rays deflected 
by a spheroidal mass by Ohanian [313] (cf. [367], p. 220). Quite generally, the ray-optical calculation 
of the energy flux gives incorrect results if, for two different light paths from the source worldline 
to the observation event, the time delay is smaller than or approximately equal to the coherence 
time. Then interference effects give rise to frequency-dependent corrections to the energy flux that 
have to be calculated with the help of wave optics. In multiple-imaging situations, the time delay 
decreases with decreasing mass of the deflector. If the deflector is a cluster of galaxies, a galaxy, 
or a star, interference effects can be ignored. Gould [181J suggested that they could be observable 
if a deflector of about 10""'^^ Solar masses happens to be close to the line of sight to a gamma-ray 
burster. In this case, the angle-separation between the (unresolvable) images would be of the order 
10~^^ arcseconds ("femtolensing"). Interference effects could make a frequency-dependent imprint 
on the total intensity. Ulmer and Goodman [409] discussed related effects for deffectors of up to 
10~^^ Solar masses. Femtolensing has not been observed so far. However, it is an interesting 
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future perspective for lensing effects where wave optics has to be taken into account. This would 
give practical relevance to the theoretical work of Herlt and Stephani |196l 1197] who calculated 
gravitational lensing on the basis of wave optics in the Schwarzschild spacetime. Wave-optical 
aspects of gravitational lensing are also discussed in [294] . 

We now turn to the case of an extended source, whose surface makes up a S-dimensional 
timelike submanifold T of the spacetime. In this case the radiation is characterized by the surface 
brightness B (= luminosity L per area) at the source and by the intensity I (= energy flux F per 
solid angle) at the observer. For each past-oriented light ray from an observation event po and to 
an event ps on T, we can relate B and / in the following way. By definition, the area distance 
Danes, relates the area at the source to the solid angle at the observer, so we get from Equation ([52|) 
I = BDarea^/ (47rAum^). As area distance and luminosity distance are related by a redshift factor, 
according to the general law (|48p . this gives the relation 

1= — r-^ — TT- (54) 
47r(l-|-z)4 ^ 

This result is, of course, valid only if the radiation from different parts of the emitting surface 
is incoherent; otherwise interference effects have to be taken into account. The most remarkable 
feature of Equation (|54p is that all distance measures have dropped out. Save for a redshift factor, 
the (observed) intensity of a radiating surface is the same for all observers. 

The law for point sources (15^ and the law for extended sources (|54p refer to bolometric quan- 
tities, i.e., to integration over all frequencies. As every astronomical observation is restricted to 
a certain frequency range, it is actually necessary to consider frequency-specific quantities. For a 
point source, one writes L = £{u}s)du}s and F = f{uJo)du}o, where the specific luminosity 
£ is a function of the emitted frequency los and the specific flux / is a function of the received 
frequency wq- As and ujq are related by a redshift factor, the frequency-specific version of 
Equation ((5^ reads 

£iujo{l + z)){l + z) 
fi^o) = -r-f^ 2 • (55 

Similarly, for an extended source one introduces a specific surface brightness b and a specific 
intensity i such that B = b{uJs)duJs and I = Jq i{^o)dLdo- Then one gets the following 
frequency-specific version of Equation (|54p . 



b{uJo{l + z)) 

^^"°^= 441 + .)^ - ^'^^ 

The results summarized in this section can also be derived from the kinetic theory of photons 
(see, e.g., |115j ). In the photon picture, the three redshift factors in Equation ([55]) are easily 
understood: The first refiects the fact that each photon undergoes a redshift; the second relates 
the rate of emission (with respect to proper time at the source) to the rate of reception (with 
respect to proper time at the obsever); the third refiects the aberration effect on the angular size 
of the source in dependence of the motion of the observer. 

As an example for the calculation of the brightness of images we consider the Schwarzschild 
spactime (see Figure [T8|). 



2.7 Conjugate points and cut points 

In general, the past light cone of an event forms caustics and transverse self-intersections, i.e., 
it is neither an embedded nor an immersed submanifold. The relevance of this fact in view of 
lensing was emphasized already in Section 12.11 In the following we demonstrate that caustics 
and transverse self-intersections of the light cone are related to extremizing properties of lightlike 
geodesies. A light cone with a caustic and a transverse self-intersection is shown in Figure [26] 
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In this section and in Section 12.81 we use mathematical techniques which are related to the 
Penrose-Hawking singularity theorems. For background material, see Penrose [329) . Hawking and 
Ellis [Hg, O'Neill [Sm, and Wald pS] . 

Recall from Section 12.21 that the caustic of the past light cone of po is the set of all points 
where this light cone is not an immersed submanifold. A point ps is in the caustic if a generator 
A of the light cone intersects at ps an infinitesimally neighboring generator. In this situation ps 
is said to be conjugate to po along A. The caustic of the past light cone of po is also called the 
"past lightlike conjugate locus" of po. 

The notion of conjugate points is related to the extremizing properties of lightlike geodesies 
in the following way. Let A be a past-oriented lightlike geodesic with A(0) = po- Assume that 
PS — A(so) is the first conjugate point along this geodesic. This means that ps is in the caustic of 
the past light cone of po and that A does not meet the caustic at parameter values between and 
sq. Then a well-known theorem says that all points A(s) with < s < sq cannot be reached from 
Po along a timelike curve arbitrarily close to A, and all points A(s) with s > sq can. For a proof 
we refer to Hawking and Ellis [193) . Proposition 4.5.11 and Proposition 4.5.12. It might be helpful 
to consult O'Neill [315) . Chapter 10, Proposition 48, in addition. 

Here we have considered a past-oriented lightlike geodesic because this is the situation with 
relevance to lensing. Actually, Hawking and Ellis consider the time-reversed situation, i.e., with 
A future-oriented. Then the result can be phrased in the following way. A material particle may 
catch up with a light ray A after the latter has passed through a conjugate point and, for particles 
staying close to A, this is impossible otherwise. The restriction to particles staying close to A is 
essential. Particles "taking a short cut" may very well catch up with a lightlike geodesic even if 
the latter is free of conjugate points. 

For a discussion of the extremizing property in the global sense, not restricted to timelike curves 
close to A, we need the notion of cut points. The precise definition of cut points reads as follows. 

As ususal, let I~{po) denote the chronological past of po, i-e., the set of all g G that can 
be reached from po along a past-pointing timelike curve. In Minkowski spacetime, the boundary 
dI~{po) of I~{po) is just the past light cone of po united with {po}- In an arbitrary spacetime, 
this is not true. A lightlike geodesic A that issues from po into the past is always confined to 
the closure of I~{po), but it need not stay on the boundary. The last point on A that is on the 
boundary is by definition j66| the cut point of A. In other words, it is exactly the part of A beyond 
the cut point that can be reached from po along a timelike curve. The union of all cut points, 
along any past-pointing lightlike geodesic A from po , is called the cut locus of the past light cone 
(or the past lightlike cut locus of po)- For the light cone in Figure [51] this is the curve (actually 
2-dimensional) where the two sheets of the light cone intersect. For the light cone in Figure [^H] 
the cut locus is the same set plus the swallow-tail point (actually 1-dimensional). For a detailed 
discussion of cut points in manifolds with metrics of Lorentzian signature, see |32j . For positive 
definite metrics, the notion of cut points dates back to Poincare |349| and Whitehead [434]. 

For a generator A of the past light cone of po , the cut point of A does not exist in either of the 
two following cases: 

1. A always stays on the boundary 9/~(po)i i-C-j it never loses its extremizing property. 

2. A is always in I^{po), i.e., it fails to be extremizing from the very beginning. 

Case[2]occurs, e.g., if there is a closed timelike curve through po- More precisely, Case[2]is excluded 
if the past distinguishing condition is satisfied at po, i.e., if for q d M. the implication 

riq)^r{po) =^ q^Po (57) 
holds. If the implication ([57]) is true, the following can be shown: 
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(PI) If, along A, the point A(s) is conjugate to A(0), the cut point of A exists and it comes on or 
before A(s). 

(P2) Assume that a point q can be reached from po along two different lightlike geodesies Ai 
and A2 from pQ. Then the cut point of Ai and of A2 exists and it comes on or before q. 

(P3) If the cut locus of a past light cone is empty, this past light cone is an embedded submanifold 
of X. 

For proofs see f336] ; The proofs can also be found in or easily deduced from [32] . Statement |(P1)| 
says that conjugate points and cut points are related by the easily remembered rule "the cut 
point comes first" . Statement |(P2)| says that a "cut" between two geodesies is indicated by the 
occurrence of a cut point. However, it does not say that exactly at the cut point a second geodesic 
is met. Such a stronger statement, which truly justifies the name "cut point", holds in globally 
hyperbolic spacetimes (see Section [O]) . Statement |(P3)| implies that the occurrence of transverse 
self-intersections of a light cone are always indicated by cut points. Note, however, that transverse 
self-intersections of the past light cone of po may occur inside I~{po) and, thus, far away from 
the cut locus. 

Statement |(P1)| implies that dI~{po) is an immersed submanifold everywhere except at the 
cut locus and, of course, at the vertex po- It is known (see [193 , Proposition 6.3.1) that dI~{po) 
is achronal (i.e., it is impossible to connect any two of its points by a timelike curve) and thus a 
3-dimensional Lipschitz topological submanifold. By a general theorem of Rademacher (see [143) . 
Theorem 3.6.1), this implies that dI~{po) is differentiable almost everywhere, i.e., that the cut 
locus has measure zero in dI~{po). Note that this argument does not necessarily imply that the 
cut locus is a "small" subset of dI~{po). Chrusciel and Galloway 79 have demonstrated, by way 
of example, that an achronal subset ^ of a spacetime may fail to be differentiable on a set that is 
dense in A. So our reasoning so far does not even exclude the possibility that the cut locus is dense 
in an open subset of dI~{po)- This possibility can be excluded in globally hyperbolic spacetimes 
where the cut locus is always a closed subset of (see Section l3T1) . In general, the cut locus need 
not be closed as is exemplified by Figure (23 

In Section 12.81 we investigate the relevance of cut points (and conjugate points) for multiple 
imaging. 

2.8 Criteria for multiple imaging 

To investigate whether multiple imaging occurs in a spacetime (A^,^), we choose any point po 
(observation event) and any timelike curve 73 (wordline of light source) in Ai. The following cases 
are possible: 

1. There is no past-pointing lightlike geodesic from po to 73. Then the observer at po does not 
see any image of the light source 73. For instance, this occurs in Minkowski spacetime for an 
inextendible worldline 73 that asymptotically approaches the past light cone of po ■ 

2. There is exactly one past-pointing lightlike geodesic from pQ to 73. Then the observer at ps 
sees exactly one image of the light source 73 . This is the situation naively taken for granted 
in pre-relativistic astronomy. 

3. There are at least two but not more than denumerahly many past-pointing lightlike geodesies 
from Po to 7s . Then the observer at po sees finitely or infinitely many distinct images of 70 
at his or her celestial sphere. 

4. There are more than denumerahly many past-pointing lightlike geodesies from p to 7. This 
happens, e.g., in rotationally symmetric situations where it gives rise to the so-called "Ein- 



29 



stein rings" (see Section I4.3|) . It also happens, e.g., in plane- wave spacetimes (see Sec- 
tion EjH). 

If Case [3] or m occurs, astronomers speak of multiple imaging. We first demonstrate that Case HI 
is exceptional. It is easy to prove (see, e.g., |336] , Proposition 12) that no finite segment of the 
timelike curve 73 can be contained in the past light cone of po- Thus, if there is a continuous 
one-parameter family of lightlike geodesies that connect po and 70, then all family members 
meet 7s at the same point, say ps- This point must be in the caustic of the light cone because 
through all non-caustic points there is only a discrete number of generators. One can always find 
a point p'q arbitrarily close to po such that 73 does not meet the caustic of the past light cone 
of Po (see, e.g., [336] . Proposition 10). Hence, by an arbitrarily small perturbation of po one can 
always destroy a Cased] situation. One may interpret this result as saying that Case ]!] situations 
have zero probability. This is, indeed, true as long as we consider point sources (worldlines). The 
observed rings and arcs refer to extended sources (worldtubes) which are close to the caustic (recall 
Section 1^75]) . Such situations occur with non-zero probability. 

We will now show how multiple imaging is related to the notion of cut points (recall Section [277]) . 
For any point po in an arbitrary spacetime, the following criteria for multiple imaging hold: 

(CI) Let A be a past-pointing lightlike geodesic from po and let ps be a point on A beyond the 
cut point or beyond the first conjugate point. Then there is a timelike curve 73 through 
PS that can be reached from po along a second past-pointing lightlike geodesic. 

(C2) Assume that at po the past-distinguishing condition ()57p is satisfied. If a timelike curve 73 
can be reached from po along two different past-pointing lightlike geodesies, at least one 
of them passes through the cut locus of the past light cone of po on or before arriving at 
7s- 

For proofs see |335] or [336] . (In [335] Criterion ](C2)] is formulated with the strong causality 
condition, although the past-distinguishing condition is sufficient.) Criteria ](C1)1 and ](C2)] say 
that the occurrence of cut points is sufficient and, in past-distinguishing spacetimes, also necessary 
for multiple imaging. The occurrence of conjugate points is sufficient but, in general, not necessary 
for multiple imaging (see Figure|25]for an example without conjugate points where multiple imaging 
occurs). So we have the following diagram: 



Occurrence of: 


Sufficient for multiple imaging in: 


Necessary for multiple imaging in: 


cut point 
conjugate point 


arbitrary spacetime 
arbitrary spacetime 


past-distinguishing spacetime 



It is well known (see |193] . in particular Proposition 4.4.5) that, under conditions which are to 
be considered as fairly general from a physical point of view, a lightlike geodesic must either be 
incomplete or contain a pair of conjugate points. These "fairly general conditions" are, e.g., the 
weak energy condition and the so-called generic condition (see |193j for details) . This result implies 
the occurrence of conjugate points and, thus, of multiple imaging, for a large class of spacetimes. 

The occurrence of conjugate points has an important consequence in view of the focusing 
equation for the area distance Darea (recall Section [2.41 and, in particular. Equation (|44]) ). As 
Daxca vanishes at the vertex s = and at each conjugate point, there must be a parameter value 
Sm with I3aroa(sm) = between the vertex and the first conjugate point. An elementary evaluation 
of the focusing equation ()44p then implies 



1 < 



Wis)? 



iRic 



(X{s)/\{s 



ds. 



(58) 
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As the Ricci term is related to the energy density via Einstein's field equation, ([58]) gives an 
estimate of energy-density-plus-shear along the ray. If we observe a multiple imaging situation, 
and if we know (or assume) that we are in a situation where conjugate points are necessary for 
multiple imaging, we have thus an estimate on energy-density-plus-shear along the ray. This line 
of thought was worked out, under additional assumptions on the spacetime, in |318J . 

2.9 Fermat's principle for light rays 

It is often advantageous to characterize light rays by a variational principle, rather than by a 
differential equation. This is particularly true in view of applications to lensing. If we have chosen 
a point po (observation event) and a timelike curve 7s (worldline of light source) in spacetime A4, 
we want to determine all past-pointing lightlike geodesies from po to 73. When working with a 
differential equation for light rays, we have to calculate all light rays issuing from po into the past, 
and to see which of them meet 73. If we work with a variational principle, we can restrict to curves 
from po to 7s at the outset. 

To set up a variational principle, we have to choose the trial curves among which the solution 
curves are to be determined and the functional that has to be extremized. Let /3po,7s denote the 
set of all past-pointing lightlike curves from po to 73. This is the set of trial curves from which 
the lightlike geodesies are to be singled out by the variational principle. Choose a past-oriented 
but otherwise arbitrary parametrization for the timelike curve 73 and assign to each trial curve 
the parameter at which it arrives. This gives the arrival time funetional T : Cp^^-y^ — > R that is 
to be extremized. With respect to an appropriate differentiability notion for T, it turns out that 
the critical points (i.e., the points where the differential of T vanishes) are exactly the geodesies 
in Cp^,j^. This result (or its time-reversed version) can be viewed as a general- relativistic Fermat 
principle: 

Among all ways to move from po to 73 in the past-pointing (or future-pointing) direc- 
tion at the speed of light, the actual light rays choose those paths that make the arrival 
time stationary. 

This formulation of Fermat's principle was suggested in 1990 by Kovner |240J . a local version 
(restricted to a convex normal neighborhood) can be found already in a 1938 paper by Temple [401] . 
The crucial idea is to refer to the arrival time which is given only along the curve 73, and not to 
some kind of global time which in an arbitrary spacetime does not even exist. The proof that 
the solution curves of Kovner's variational principle are, indeed, exactly the lightlike geodesies 
was given in [332) . The proof can also be found, with a slight restriction on the spacetime that 
simplifies matters considerably, in [367] . An alternative version, based on making -Cpo-Ts i'^^o a 
Hilbert manifold, is given in [334]. 

As in ordinary optics, the light rays make the arrival time stationary but not necessarily mini- 
mal. A more detailed investigation shows that for a geodesic A € ^pots the following holds. (For 
the notion of conjugate points see Sections 12.21 and 12.71 ) 

(Al) If along A there is no point conjugate to po, A is a strict local minimum of T. 

(A2) If A passes through a point conjugate to pq before arriving at 7, it is a saddle of T. 

(A3) If A reaches the first point conjugate to po exactly on its arrival at 73, it may be a local 
minimum or a saddle but not a local maximum. 

For a proof see |332] or [334] . The fact that local maxima cannot occur is easily understood from 
the geometry of the situation: For every trial curve we can find a neighboring trial curve with 
a larger T by putting "wiggles" into it, preserving the lightlike character of the curve. Also for 
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Fermat's principle in ordinary optics, where light propagation is characterized by a positive index 
of refraction on Euclidean 3-space, the extremum is never a local maximum, as is mentioned, e.g., 
in Born and Wolf j46) . p. 137. Note, however, that in the quasi- Newtonian lensing approximation 
with one or more deflector planes, where only broken straight lines are allowed as trial paths, local 
maxima do occur, see, e.g., [343] . Also, in the general formalism of ray optics, where the rays are 
the solutions of Hamilton's equations with an unspecified Hamiltonian, local maxima do occur, 
unless a certain regularity condition is imposed on the Hamiltonian [337j . 

The advantage of Kovner's version of Fermat's principle is that it works in an arbitrary space- 
time. In particular, the spacetime need not be stationary and the light source may arbitrarily move 
around (at subluminal velocity, of course). This allows applications to dynamical situations, e.g., 
to lensing by gravitational waves (see Section fS. 11 1) . If the spacetime is stationary or conformally 
stationary, and if the light source is at rest, a purely spatial reformulation of Fermat's principle 
is possible. This more specific version of Femat's principle is known since decades and has found 
various applications to lensing (see Section [4.2(1 . A more sophisticated application of Fermat's 
principle to lensing theory is to put up a Morse theory in order to prove theorems on the possible 
number of images. In its strongest version, this approach has to presuppose a globally hyperbolic 
spacetime and will be reviewed in Section 13.31 

For a generalization of Kovner's version of Fermat's principle to the case that observer and 
light source have a spatial extension see [340] . 

An alternative variational principle was introduced by Frittelli and Newman [154] and evaluated 
in |155[ll53j . While Kovner's principle, like the classical Fermat principle, is a varional principle for 
rays, the Frittelli-Newman principle is a variational principle for wave fronts. (For the definition 
of wave fronts see Section 12.21 ) Although Frittelli and Newman call their variational principle 
a version of Fermat's principle, it is actually closer to the classical Huygens principle than to 
the classical Fermat principle. Again, one fixes po and 73 as above. To define the trial maps, 
one chooses a set yV(po) of wave fronts, such that for each lightlike geodesic through po there 
is exactly one wave front in yV(po) that contains this geodesic. Hence, yV(po) is in one-to-one 
correspondence to the lightlike directions at po and thus to the 2-sphere. Now let yV(poj 7s) denote 
the set of all wave fronts in W{po) that meet 73. We can then define the arrival time functional 
T : yV(po,7s) — > K by assigning to each wave front the parameter value at which it intersects 
73 . There are some cases to be excluded to make sure that T is defined on an open subset of 
W(po) — S"^ 1 single-valued and differentiable. If this is the case, one finds that T is stationary at 
W € W{po) if and only if W contains a lightlike geodesic from po to 73. Thus, to each image of 
73 on the sky of po there corresponds a critical point of T. The great technical advantage of the 
Frittelli-Newman principle over the Kovner principle is that T is defined on a finite dimensional 
manifold, directly to be identified with (part of) the observer's celestial sphere. The arrival time 
T in the Frittelli-Newman approach is directly analogous to the "Fermat potential" in the quasi- 
Newtonian formalism which is discussed, e.g., in [367) . In view of applications, a crucial point is 
that the space yV(po) is a matter of choice; there are many wave fronts which have one light ray 
in common. There is a natural choice, e.g., in asymptotically simple spacetimes (see Section [3. 4p . 

Frittelli, Newman, and collaborators have used their variational principle in combination with 
the exact lens map (recall Section 12. ip to discuss thick and thin lens models from a spacetime 
perspective [155[ 1153] . Methods from differential topology or global analysis, e.g., Morse theory, 
have not yet been applied to the Frittelli-Newman principle. 
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3 Lensing in Globally Hyperbolic Spacetimes 



In a globally hyperbolic spacetime, considerably stronger statements on qualitative lensing features 
can be made than in an arbitrary spacetime. This includes, e.g., multiple imaging criteria in terms 
of cut points or conjugate points, and also applications of Morse theory. The value of these results 
lies in the fact that they hold in globally hyperbolic spacetimes without symmetries, where lensing 
cannot be studied by explicitly integrating the lightlike geodesic equation. 

The most convenient formal definition of global hyperbolicity is the following. In a spacetime 
{A4,g), a subset C of is called a Cauchy surface if every inextendible causal (i.e., timelike or 
lightlike) curve intersects C exactly once. A spacetime is globally hyperbolic if and only if it admits 
a Cauchy surface. The name globally hyperbolic refers to the fact that for hyperbolic differential 
equations, like the wave equation, existence and uniqueness of a global solution is guaranteed for 
initial data given on a Cauchy surface. For details on globally hyperbolic spacetimes see, e.g., 
|193l [22] . It was demonstrated by Geroch |165) that every gobally hyperbolic spacetime admits 
a continuous function t : M. — >■ M such that t~^{to) is a Cauchy surface for every to G K. A 
complete proof of the fact that such a Cauchy time function can be chosen differentiable was given 
much later by Bernal and Sanchez [SH |35l |36] . 

The topology of a globally hyperbolic spacetime is determined by the topology of any of its 
Cauchy surfaces, ~ C x M. Note, however, that the converse is not true because Ci x R may 
be homeomorphic (and even diffeomorphic) to C2 x R without Ci being homeomorphic to C2. For 
instance, one can construct a globally hyperbolic spacetime with topology M"* that admits a Cauchy 
surface which is not homeomorphic to [305J . 

In view of applications to lensing the following observation is crucial. If one removes a point, 
a worldline (timelike curve), or a world tube (region with timelike boundary) from an arbitrary 
spacetime, the resulting spacetime cannot be globally hyperbolic. Thus, restricting to globally 
hyperbolic spacetimes excludes all cases where a deflector is treated as non-transparent by cutting 
its world tube from spacetime (see Figure [25] for an example). Note, however, that this does 
not mean that globally hyperbolic spacetimes can serve as models only for transparent deflectors. 
First, a globally hyperbolic spacetime may contain "non-transparent" regions in the sense that a 
light ray may be trapped in a spatially compact set. Second, the region outside the horizon of a 
(Schwarzschild, Kerr, . . . ) black hole is globally hyperbolic. 

3.1 Criteria for multiple imaging in globally hyperbolic spacetimes 

In Section [2. 71 we have considered the past light cone of an event po in an arbitrary spacetime. We 
have seen that conjugate points (= caustic points) indicate that the past light cone fails to be an 
immersed submanifold and that cut points indicate that it fails to be an embedded submanifold. 
In a globally hyperbolic spacetime (A^,g), the following additional statements are true. 

(HI) The past light cone of any event po, together with the vertex {po}i is closed in M.. 

(H2) The cut locus of the past light cone of po is closed in M. 

(113) Let PS be in the cut locus of the past light cone of po but not in the conjugate locus 
(= caustic). Then pg can be reached from po along two different lightlike geodesies. The 
past light cone of po has a transverse self-intersection at ps . 

(H4) The past light cone of po is an embedded submanifold if and only if its cut locus is empty. 

Analogous results hold, of course, for the future light cone, but the past version is the one that 
has relevance for lensing. For proofs of these statements see [32], Propositions 9.35 and 9.29 and 
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Theorem 9.15, and [336], Propositions 13, 14, and 15. According to Statenient |(H3)[ a "cut point" 
indicates a "cut" of two lightlike geodesies. For geodesies in Riemannian manifolds (i.e., in the 
positive definite case) , an analogous statement holds if the Riemannian metric is complete and is 
known as Poincare theorem [3491 1434] . It was this theorem that motivated the name "cut point". 
Note that Statement |(H3)| is not true without the assumption that ps is not in the caustic. This 
is exemplified by the swallow-tail point in Figure [26] However, as points in the caustic of the past 
light cone of po can be reached from po along two "infinitesimally close" lightlike geodesies, the 
name "cut point" may be considered as justified also in this case. 

In addition to Statements [(HI) | and |(H2)] one would like to know whether in globally hyperbolic 
spactimes the caustic of the past light cone of po (also known as the past lightlike conjugate locus 
of po) is closed. 

This question is closely related to the question of whether in a complete Riemannian manifold 
the conjugate locus of a point is closed. For both questions, the answer was widely believed to be 
'yes' although actually it is 'no'. To the surprise of many, Margerin |269] constructed Riemannian 
metrics on the 2-sphere such that the conjugate locus of a point is not closed. Taking the product 
of such a Riemannian manifold with 2-dimensional Minkowski space gives a globally hyperbolic 
spacetime in which the caustic of the past light cone of an event is not closed. 

In Section [2.81 we gave criteria for the number of past-oriented lightlike geodesies from a point 
Po (observation event) to a timelike curve 73 (worldline of a light source) in an arbitrary spacetime. 
With Statements |(IIl)[ |(H2)[ [(113)] and |(H4)] at hand, the following stronger criteria can be given. 

Let {Ai,g) be globally hyperbolic, fix a point po and an inextendible timelike curve 73 in A^. 
Then the following is true: 

(H5) Assume that 73 enters into the chronological past I^{po) of po- Then there is a past- 
oriented lightlike geodesic A from po to 73 that is completely contained in the boundary 
of I~{po)- This geodesic does not pass through a cut point or through a conjugate point 
before arriving at 73. 

(H6) Assume that 73 can be reached from po along a past-oriented lightlike geodesic that passes 
through a conjugate point or through a cut point before arriving at 73. Then 73 can be 
reached from po along a second past-oriented lightlike geodesic. 

Statement |(H5)| was proven in [408] with the help of Morse theory. For a more elementary proof 
see |336 , Proposition 16. Statement [ (H5) | gives a characterization of the primary image in globally 
hyperbolic spacetimes. (By definition, an image is "primary" if no other image shows the light 
source at an older age.) The condition of 73 entering into the chronological past of po is necessary 
to exclude the case that po sees no image of 73. Statement [(H5)| implies that there is a unique 
primary image unless 73 passes through the cut locus of the past light cone of po- The primary 
image has even parity. If the weak energy condition is satisfied, the focusing theorem implies that 
the primary image has magnification factor > 1, i.e., that it appears brighter than a source of 
the same luminosity at the same affine distance and at the same redshift in Minkowski spacetime 
(recall Sections |2 .41 and \2M in particular the inequality ([46]) ). 
For a proof of Statement |(H6)| see [336], Proposition 17. 

3.2 Wave fronts in globally hyperbolic spacetimes 

In Section 12.21 the notion of wave fronts was discussed in an arbitrary spacetime {A4,g). It was 
mentioned that a wave front can be viewed as a subset of the space J\f of all lightlike geodesies in 
{M,g). This approach is particularly useful in globally hyperbolic spacetimes, as was demonstrated 
by Low |263l 1264] . The construction is based on the observations that, if {M,g) is globally 
hyperbolic and C is a smooth Cauchy surface, the following is true: 
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(Nl) Af can be identified with a sphere bundle over C. The identification is made by assigning to 
each hghthke geodesic its tangent hne at the point where it intersects C. As every sphere 
bundle over an orientable 3- manifold is trivializable, J\f is diffeomorphic to C x S"^. 

(N2) Af carries a natural contact structure. (This contact structure is also discussed, in twistor 
language, in [330) . volume II.) 

(N3) The wave fronts are exactly the Legendre submanifolds of Af. 

Using Statement |(N1)[ the projection from Af to C assigns to each wave front its intersection 
with C, i.e., an "instantaneous wave front" or "small wave front" (cf. Section [2. 2 1 for terminology). 
The points where this projection has non-maximal rank give the caustic of the small wave front. 
According to the general stability results of Arnold (see [2]), the only caustic points that are 
stable with respect to local perturbations within the class of Legendre submanifolds are cusps and 
swallow-tails. By Statement |(N3)[ perturbing within the class of Legendre submanifolds is the 
same as perturbing within the class of wave fronts. For this local stability result the assumption 
of global hyperbolicity is irrelevant because every spacelike hypersurface is a Cauchy surface for 
an appropriately chosen neighborhood of any of its points. So we get the result that was already 
mentioned in Section r2.2l In an arbitrary spacetime, a caustic point of an instantaneous wave front 
is stable if and only if it is a cusp or a swallow-tail. Here stability refers to perturbations that keep 
the metric and the hypersurface fixed and perturb the wave front within the class of wave fronts. 
For a picture of an instantaneous wave front with cusps and a swallow-tail point, see Figure [^Hl In 
Figure [UJ the caustic points are neither cusps nor swallow-tails, so the caustic is unstable. 

3.3 Fermat's principle and Morse theory in globally hyperbolic space- 
times 

In an arbitrary spacetime, the past-oriented lightlike geodesies from a point po (observation event) 
to a timelike curve 7s (worldline of light source) are the solutions of a variational principle (Kovner's 
version of Fermat's principle; see Section |2.9[) . Every solution of this variational principle corre- 
sponds to an image on po 's sky of 73 . Determining the number of images is the same as determining 
the number of solutions to the variational problem. If the variational functional satisfies some tech- 
nical conditions, the number of solutions to the variational principle can be related to the topology 
of the space of trial paths. This is the content of Morse theory. In the case at hand, the "technical 
conditions" turn out to be satisfied in globally hyperbolic spacetimes. 

To briefly review Morse theory, we consider a differentiable function F : X — !• M on a real 
manifold X . Points where the differential of F vanishes are called critical points of F. A critical 
point is called non- degenerate if the Hessian of F is non-degenerate at this point. F is called a 
Morse function if all its critical points are non-degenerate. In applications to variational problems, 
X is the space of trial maps, F is the functional to be varied, and the critical points of F are the 
solutions to the variational problem. The non-degeneracy condition guarantees that the character 
of each critical point - local minimum, local maximum, or saddle - is determined by the Hessian 
of F at this point. The index of the Hessian is called the Morse index of the critical point. It is 
defined as the maximal dimension of a subspace on which the Hessian is negative definite. At a 
local minimum the Morse index is zero, at a local maximum it is equal to the dimension of X . 

Morse theory was first worked out by Morse [285] for the case that X is finite-dimensional 
and compact (see Milnor '278 for a detailed exposition). The main result is the following. On a 
compact manifold A", for every Morse function the Morse inequalities 

Nk>Bk, fc = 0,1,2,..., (59) 
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and the Morse relation 



fc=0 A;=0 



(-1)^-7V, - V(-l)'''Sfc (60) 



hold true. Here Nk denotes the number of critical points with Morse index k and Bk denotes the 
fcth Beta number of X. Formally, Bk is defined for each topological space X in terms of the fcth 
singular homology space Hk{X) with coefficients in a field F (see, e.g., |104j . p. 32). (The results of 
Morse theory hold for any choice of F.) Geometrically, Bq counts the connected components of X 
and, for k > 1, Bk counts the "holes" in X that prevent a fc-cycle with coefficients in F from being 
a boundary. In particular, if X is contractible to a point, then Bk = for fc > 1. The right-hand 
side of Equation (|60l) is, by definition, the Euler characteristic of X. By compactness of A", all Nk 
and Bk are finite and in both sums of Equation (j60p only finitely many summands are different 
from zero. 

Palais and Smale }319| 1320] realized that the Morse inequalities and the Morse relations are also 
true for a Morse function on a non-compact and possibly infinite-dimensional Hilbert manifold, 
provided that F is bounded below and satisfies a technical condition known as Condition C or 
Palais-Smale condition. In that case, the Nk and Bk need not be finite. 

The standard application of Morse theory is the geodesic problem for Riemannian (i.e., positive 
definite) metrics: given two points in a Riemannian manifold, to find the geodesies that join them. 
In this case F is the "energy functional" (squared-length functional) . Varying the energy functional 
is related to varying the length functional like Hamilton's principle is related to Maupertuis' prin- 
ciple in classical mechanics. For the space X one chooses, in the Palais-Smale approach [319] . the 
iJ-'^-curves between the given two points. (An i/"-curve is a curve with locally square-integrable 
nth derivative). This is an infinite-dimensional Hilbert manifold. It has the same homotopy type 
(and thus the same Betti numbers) as the loop space of the Riemannian manifold. (The loop 
space of a connected topological space is the space of all continuous curves joining any two fixed 
points.) On this Hilbert manifold, the energy functional is always bounded from below, and its 
critical points are exactly the geodesies between the given end-points. A critical point (geodesic) 
is non-degenerate if the two end-points are not conjugate to each other, and its Morse index is the 
number of conjugate points in the interior, counted with multiplicity ("Morse index theorem"). 
The Palais-Smale condition is satisfied if the Riemannian manifold is complete. So one has the 
following result: Fix any two points in a complete Riemannian manifold that are not conjugate to 
each other along any geodesic. Then the Morse inequalities (|59l) and the Morse relation (|60l) are 
true, with Nk denoting the number of geodesies with Morse index k between the two points and 
Bk denoting the fcth Betti number of the loop space of the Riemannian manifold. The same result 
is achieved in the original version of Morse theory |285j (cf. |278) ) by choosing for X the space of 
broken geodesies between the two given points, with TV break points, and sending iV cx) at the 
end. 

Using this standard example of Morse theory as a pattern, one can prove an analogous result 
for Kovner's version of Fermat's principle. The following hypotheses have to be satisfied: 

(Ml) po is a point and 7s is a timelike curve in a globally hyperbolic spacetime {Ai,g). 

(M2) 7s does not meet the caustic of the past light cone of po- 

(M3) Every continuous curve from po to 7s can be continuously deformed into a past-oriented 
lightlike curve, with all intermediary curves starting at po and terminating on 73. 

The global hyperbolicity assumption in Statement |(Ml)| is analogous to the completeness assump- 



tion in the Riemannian case. Statement (M2) is the direct analogue of the non-conjugacy condition 



in the Riemannian case. Statement (M3) is necessary for relating the space of trial paths (i.e. 



of past-oriented lightlike curves from po to 73) to the loop space of the spacetime manifold or, 
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equivalently, to the loop space of a Cauchy surface. If Statements |(M1)[ |(M2)[ and |(M3)] are valid, 
the Morse inequalities ((5^)) and the Morse relation (|60p are true, with Nk denoting the number of 
past-oriented lightlike geodesies from po to 73 that have k conjugate points in its interior, counted 
with muliplicity, and Bk denoting the fcth Betti number of the loop space of A4 or, equivalently, 
of a Cauchy surface. This result was proven by Uhlenbeck [408, a la Morse and Milnor, and by 
Giannoni and Masiello |170] in an infinite-dimensional Hilbert manifold setting a la Palais and 
Smale. A more general version, applying to spacetime regions with boundaries, was worked out by 
Giannoni, Masiello, and Piccione jlTll 1172] . In the work of Giannoni et al., the proofs are given 
in greater detail than in the work of Uhlenbeck. 

If Statements |(M1)[ |(M2)[ and |(M3)| are satisfied, Morse theory gives us the following results 
about the number of images of 7s on the sky of po (cf. [274] ): 

(Rl) If M is not contractible to a point, there are infinitely many images. This follows from 
Equation (|59l) because for the loop space of a non-contractible space either Bq is infinite 
or almost all Bk are different from zero [3781 . 

(R2) If Ai is contractible to a point, the total number of images is infinite or odd. This follows 
from Equation (|60p because in this case the loop space of A4 is contractible to a point, so all 
Betti numbers B^ vanish with the exception of i?o = 1- As a consequence. Equation (j60p can 
be written as 7V+ — N- = 1, where is the number of images with even parity (geodesies 
with even Morse index) and N- is the number of images with odd parity (geodesies with 
odd Morse index), hence N+ + N- ^ 2N- + 1. 

These results apply, in particular, to the following situations of physical interest: 
Black hole spacetimes. 

Let {A4, g) be the domain of outer communication of the Kerr spacetime, i.e., the region between 
the (outer) horizon and infinity (see Section l5.8p . Then the assumption of global hyperbolicity is 
satisfied and A4 is not contractible to a point. Statement |(M3)| is satisfied if 7s is inextendible 
and approaches neither the horizon nor (past lightlike) infinity for t — > —00. (This can be checked 
with the help of an analytical criterion that is called the "metric growth condition" in [408] .) If, in 
addition Statement | (M2) | is satisfied, the reasoning of Statement |(R1)| applies. Hence, a Kerr black 
hole produces infinitely many images, under fairly generic conditions on the motion of the light 
source. The details of this argument are worked out, for the more general case of a Kerr-Newman 
black hole, in [TH]. 

Asymptotically simple and empty spacetimes. 

As discussed in Section 13.41 asymptotically simple and empty spacetimes are globally hyperbolic 
and contractible to a point. They can be viewed as models of isolated transparent gravitational 
lenses. Statement | (M3) | is satisfied if 73 is inextendible and bounded away from past lightlike infin- 
ity y~ . If, in addition. Statement |(M2)| is satisfied. Statement [(R2)] guarantees that the number of 
images is infinite or odd. If it were infinite, we had as the limit curve a past-inextendible lightlike 
geodesic that would not go out to ■y~ , in contradiction to the definition of asymptotic simplicity. 
So the number of images must be finite and odd. The same odd-number theorem can also be 
proven with other methods (see Section [3^ . 

In this way Morse theory provides us with precise mathematical versions of the statements 
"A black hole produces infinitely many images" and "An isolated transparent gravitational lens 
produces an odd number of images" . When comparing this theoretical result with observations 
one has to be aware of the fact that some images might be hidden behind the deflecting mass, some 
might be too faint for being detected, and some might be too close together for being resolved. 
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In conformally stationary spacetimes, with 73 being an integral curve of the conformal Killing 
vector field, a simpler version of Fermat's principle and Morse theory can be used (see Section H?^ . 

3.4 Lensing in asymptotically simple and empty spacetimes 

In elementary optics one often considers "light sources at infinity" which arc characterized by the 
fact that all light rays emitted from such a source are parallel to each other. In general relativity, 
"light sources at infinity" can be defined if one restricts to a special class of spacetimes. These 
spacetimes, known as "asymptotically simple and empty" are, in particular, globally hyperbolic. 
Their formal definition, which is due to Penrose ,326j . reads as follows (cf. 193 , p. 222., and 1148] . 
Section 2.3). (Recall that a spacetime is called "strongly causal" if each neighborhood of an event 
p admits a smaller neighborhood that is intersected by any non-spacelike curve at most once.) 

A spacetime {A4,g,) is called asymptotically simple and empty if there is a strongly causal 
spacetime (Ai^g) with the following properties: 

(51) A4 is an open submanifold of Ai with a non-empty boundary dAi. 

(52) There is a smooth function fl : M — > M. such that M = {p e M\^l{p) > 0}, dM = {p G 
M\n{p) — 0}, dfl ^ everywhere on dM and g = fl^g on Ai. 

(53) Every inextendible lightlike geodesic in Ai has past and future end-point on dM. 

(54) There is a neighborhood V of DM such that the Ricci tensor of g vanishes on V O M. 

Asymptotically simple and empty spacetimes are mathematical models of transparent uncharged 
gravitating bodies that are isolated from all other gravitational sources. In view of lensing, the 
transparency condition | ( S3) | is particularly important. 

We now summarize some well-known facts about asymptotically simple and empty spacetimes 
(cf. again |193j . p. 222, and |148J . Section 2.3). Every asymptotically simple and empty spacetime 
is globally hyperbolic. dM is a 5-lightlike hypersurface of A^. It has two connected components, 
denoted and . Each lightlike geodesic in (A^ , g) has past end-point on and future 
end-point on . Geroch |166| gave a proof that every Cauchy surface C of an asymptotically 
simple and empty spacetime has topology and that y^ has topology S'^ x K. The original proof, 
which is repeated in |193j . is incomplete. A complete proof that C must be contractible and that 
y^ has topology S*^ x R was given by Newman and Clarke [305] (cf. [304] ): the stronger statement 
that C must have topology R'^ needs the assumption that the Poincare conjecture is true (i.e., that 
every compact and simply connected 3-manifold is a 3-sphere). In |305| the authors believed that 
the Poincare conjecture was proven, but the proof they are refering to was actually based on an 
error. As the more recent proof of the Poincare conjecture by Perelman |331| (cf. |281| ) has been 
generally accepted as being correct, the matter is now settled. 

As y^ is a lightlike hypersurface in , it is in particular a wave front in the sense of Section [2?2l 
The generators of are the integral curves of the gradient of fl. The generators of can be 
interpreted as the "worldlines" of light sources at infinity that send light into M . The generators 
of y+ can be interpreted as the "worldlines" of observers at infinity that receive light from M. 
This interpretation is justified by the observation that each generator of J^^ is the limit curve for 
a sequence of timelike curves in M . 

For an observation event po inside M and light sources at infinity, lensing can be investigated 
in terms of the exact lens map (recall Section 12. 1[) , with the role of the source surface T played by 
y^ . (For the mathematical properties of the lens map it is rather irrelevant whether the source 
surface is timelike, lightlike or even spacelike. What matters is that the arriving light rays meet 
the source surface transversely.) In this case the lens map is a map S'^ — > S"^, namely from the 
celestial sphere of the observer to the set of all generators of y~ . One can construct it in two 
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steps: First determine the intersection of the past light cone of po with , then project along 
the generators. The intersections of light cones with J^^ ("light cone cuts of null infinity") have 
been studied in [MIIMI]- 

One can assign a mapping degree (= Brouwer degree = winding number) to the lens map S'^ — > 
S*^ and prove that it must be ±1 |338j . (The proof is based on ideas of j305| 1304] . Earlier proofs 
of similar statements - |241J . Lemma 1, and |336j . Theorem 6 - are incorrect, as outlined in [338].) 
Based on this result, the following odd- number theorem can be proven for observer and light source 
inside M |338j : Fix a point po and a timelike curve 73 in an asymptotically simple and empty 
spacetime {M,g). Assume that the image of 73 is a closed subset oi A4\ J^~^ and that 73 meets 
neither the point po nor the caustic of the past light cone of po ■ Then the number of past-pointing 
lightlike geodesies from po to 73 in Ai is finite and odd. The same result can be proven with the 
help of Morse theory (see Section 13. 3p . 

We will now give an argument to the effect that in an asymptotically simple and empty space- 
time the non-occurrence of multiple imaging is rather exceptional. The argument starts from a 
standard result that is used in the Penrose-Hawking singularity theorems. This standard result, 
given as Proposition 4.4.5 in [193J . says that along a lightlike geodesic that starts at a point po 
there must be a point conjugate to po, provided that 

1. the so-called generic condition is satisfied at po, 

2. the weak energy condition is satisfied along the geodesic, and 

3. the geodesic can be extended sufficiently far. 

The last assumption is certainly true in an asymptotically simple and empty spacetime because 
there all lightlike geodesies are complete. Hence, the generic condition and the weak energy 
condition guarantee that every past light cone must have a caustic point. We know from Section [3. II 
that this implies multiple imaging for every observer. In other words, the only asymptotically 
simple and empty spacetimes in which multiple imaging does not occur are non-generic cases (like 
Minkowski spacetime) and cases where the gravitating bodies have negative energy. 

The result that, under the aforementioned conditions, light cones in an asymptotically simple 
and empty spacetime must have caustic points is due to |208) . This paper investigates the past 
light cones of points on J^"*" and their caustics. These light cones are the generalizations, to an 
arbitrary asymptotically simple and empty spacetime, of the lightlike hyperplanes in Minkowski 
spacetime. With their help, the eikonal equation (Hamilton- Jacobi equation) g^^diSdjS = in an 
asymptotically simple and empty spacetime can be studied in analogy to Minkowski spacetime |157[ 
1156) . In Minkowski spacetime the lightlike hyperplanes are associated with a two-parameter family 
of solutions to the eikonal equation. In the terminology of classical mechanics such a family is called 
a complete integral. Knowing a complete integral allows constructing all solutions to the Hamilton- 
Jacobi equation. In an asymptotically simple and empty spacetime the past light cones of points 
on give us, again, a complete integral for the eikonal equation, but now in a generalized sense, 
allowing for caustics. These past light cones are wave fronts, in the sense of Section and cannot 
be represented as surfaces .S* = constant near caustic points. The way in which all other wave fronts 
can be determined from knowledge of this distinguished family of wave fronts is detailed in |156) . 
The distinguished family of wave fronts gives a natural choice for the space of trial maps in the 
Frittelli-Newman variational principle which was discussed in Section 12.91 
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4 Lensing in Spacetimes with Symmetry 



4.1 Lensing in conformally flat spacetimes 

By definition, a spacetime is conformally flat if the conformal curvature tensor (= Weyl tensor) 
vanishes. An equivalent condition is that every point admits a neighborhood that is conformal to an 
open subset of Minkowski spacetime. As a consequence, conformally flat spacetimes have the same 
local conformal symmetry as Minkowski spacetime, that is they admit 15 independent conformal 
Killing vector fields. The global topology, however, may be different from the topology of Minkowski 
spacetime. The class of conformally flat spacetimes includes all (kinematic) Robertson- Walker 
spacetimes. Other physically interesting examples are some (generalized) interior Schwarzschild 
solutions and some pure radiation spacetimes. All conformally fiat solutions to Einstein's field 
equation with a perfect fluid or an electromagnetic field are known (see [388] . Section 37.5.3). 

If a spacetime is globally conformal to an open subset of Minkowski spacetime, the past light 
cone of every event is an embedded submanifold. Hence, multiple imaging cannot occur (recall 
Section 12.81) . For instance, multiple imaging occurs in spatially closed but not in spatially open 
Robertson- Walker spacetimes. In any conformally fiat spacetime, there is no image distortion, i.e., 
a sufficiently small sphere always shows a circular outline on the observer's sky (recall Section [2.5|) . 
Correspondingly, every infinitesimally thin bundle of light rays with a vertex is circular, i.e., the 
extremal angular diameter distances D+ and D- coincide (recall Section [2^ . In addition, D+ = 
D- also coincides with the area distance i^arca, at least up to sign. D+ = D- changes sign at every 
caustic point. As has a zero if and only if Z3__ has a zero, all caustic points of an infinitesimally 
thin bundle with vertex are of multiplicity two (anastigmatic focusing), so all images have even 
parity. 

The geometry of light bundles can be studied directly in terms of the Jacobi equation (= equation 
of geodesic deviation) along lightlike geodesies. For a detailed investigation of the latter in confor- 
mally fiat spacetimes, see [34 Ij . The more special case of Friedmann-Lemaitre-Robertson- Walker 
spacetimes (with dust, radiation, and cosmological constant) is treated in |131| . For bundles with 
vertex, one is left with one scalar equation for = £)_ = ±_Darca7 that is the focusing equa- 
tion ([44p with (7 = 0. This equation can be explicitly integrated for Friedmann-Robertson- Walker 
spacetimes (dust without cosmological constant). In this way one gets, for the standard observer 
field in such a spacetime, relations between redshift and (area or luminosity) distance in closed 
form [273] . There are generalizations for a Robertson- Walker universe with dust plus cosmological 
constant [228] and dust plus radiation plus cosmological constant [94]. Similar formulas can be 
written for the relation between age and redshift |403] . 

4.2 Lensing in conformafly stationary spacetimes 

Conformally stationary spacetimes are models for gravitational fields that are time-independent 
up to an overall conformal factor. (The time-dependence of the conformal factor is important, e.g., 
if cosmic expansion is to be taken into account.) This is a reasonable model assumption for many, 
though not all, lensing situations of interest. It allows describing light rays in a 3-dimensional (spa- 
tial) formalism that will be outlined in this section. The class of conformally stationary spacetimes 
includes spherically symmetric and static spacetimes (see Sections 14.3]) and axisymmetric station- 
ary spacetimes (see Section l4^ . Also, conformally flat spacetimes (see Section l4T]) arc confor- 
mally stationary, at least locally. A physically relevant example where the conformal-stationarity 
assumption is not satisfied is lensing by a gravitational wave (see Section [5. 11]) . 

By definition, a spacetime is conformally stationary if it admits a timelike conformal Killing 
vector field W. If W is complete and if there are no closed timelike curves, the spacetime must 
be a product, ~ R x with a (Hausdorff and paracompact) 3- manifold Ai and W parallel 
to the M-lines [185] . If we denote the projection from M to R hy t and choose local coordinates 
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X = on Ai, the metric takes the form 

g = e^/(*'^) (^-{dt + 4>^{x) dxf")^ + g^^{x) dx^ dx"^ (61) 

with iijV,... = 1,2,3. The conformal factor e^^ does not affect the lighthke geodesies apart 
from their parametrization. So the paths of hght rays are completely determined by the metric 
g = gf^^{x)dx'^dx'^ and the one-form cj) = (f>fj^{x)dx^ which live on M. The metric g must be positive 
definite to give a spacetime metric of Lorentzian signature. We call / the redshift potential, g the 
Fermat metric and (j) the Fermat one-form. The motivation for these names will become clear from 
the discussion below. 

If (j)fj_ — df^h, where /i is a function of a; = (x^ ,x'^ ,x^), we can change the time coordinate 
according to t i — t + h{x), thereby transforming ^^dx'^ to zero, i.e., making the surfaces t — 
constant orthogonal to the t-lines. This is the conformally static case. Also, Equation (j6ip includes 
the stationary case (/ independent of t) and the static case (c/)^ = d^h and / independent of t). 

In Section 12.91 we have discussed Kovner's version of Fermat's principle which characterizes 
the lightlike geodesies between a point (observation event) po and a timelike curve (worldline of 
light source) 73. In a conformally stationary spacetime we may specialize to the case that 73 
is an integral curve of the conformal Killing vector field, parametrized by the "conformal time" 
coordinate t (in the past-pointing sense, to be in agreement with Section r2.9p . Without loss of 
generality, we may assume that the observation event po takes place at t — 0. Then for each trial 
path (past-oriented lightlike curve) A from po to 73 the arrival time is equal to the travel time 
in terms of the time function t. By Equation (|61l) this puts the arrival time functional into the 
following coordinate form 




, ^dxf^ dx'^ 7 , ,„ 

where £ is any parameter along the trial path, ranging over an interval [^1,^2] that depends on 
the individual curve. The right-hand side of Equation (j62p is a functional for curves in Ai with 
fixed end-points. The projections to M of light rays are the stationary points of this functional. 
In general, the right-hand side of Equation (|62p is the length functional of a Finsler metric. In 
the conformally static case (j)^ = d^h, the integral over (f)fj_{x)dx^ /di is the same for all trial paths, 
so we are left with the length functional of the Fermat metric g. In this case the light rays, if 
projected to M, are the geodesies of g. Note that the travel time functional ([S^ is invariant under 
reparametrization; in the terminology of classical mechanics, it is a special case of Maupertuis' 
principle. It is often convenient to switch to a parametrization-dependent variational principle 
which, in the terminology of classical mechanics, is called Hamilton's principle. The Maupertuis 
principle with action functional (|62p corresponds to Hamilton's principle with a Lagrangian 

I dx'^dx- ~ dx'^ 

(see, e.g., Caratheodory [73], Sections 304-307). The pertaining Euler-Lagrange equations read 

dx'^ dx^\ /„ 7 „ 7 \ dx'' 



= (64) 



where V'^^ are the Christoffel symbols of the Fermat metric g. The solutions admit the constant 
of motion 

dx^ dx" 

Sf^''^^^~^~Jf " constant, (65) 
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which can be chosen equal to 1 for each ray, such that t gives the g-arclength. By Equation (|62|) . 
the latter gives the travel time if = 0. According to Equation (|64[). the Fermat two-form 

ll> = d(f) (66) 

exerts a kind of Coriolis force on the light rays. This force has the same mathematical structure 
as the Lorentz force in a magnetostatic field. In this analogy, corresponds to the magnetic 
(vector) potential. In other words, light rays in a conformally stationary spacetime behave like 
charged particles, with fixed charge-to-mass ratio, in a magnetostatic field w on a Riemannian 
manifold {M,g). For linearly polarized light, the Fermat geometry can also be used for describing 
the propagation of the polarization plane |189] . One finds that the polarization plane undergoes 
a rotation similar to the Faraday rotation in a magnetic field. This observation corroborates the 
formal analogy between lo and a magnetic field. The gravitational analogue of the Faraday rotation 
was already discussed briefly in Section [^751 above. 

Fermat 's principle in static spacetimes dates back to Weyl [430] (cf. [26011397] ). The stationary 
case was treated by Pham Mau Quan [344] . who even took an isotropic medium into account, 
and later, in a more elegant presentation, by Brill [60]. These versions of Fermat's principle are 
discussed in several text-books on general relativity (see, e.g., |2791I147|I389] for the static and |253] 
for the stationary case). A detailed discussion of the conformally stationary case can be found 
in [333] . Fermat's principle in conformally stationary spacetimes was used as the starting point for 
deriving the lens equation of the quasi-Newtonian apporoximation formalism by Schneider |366] 
(cf. [367] 1. As an alternative to the name "Fermat metric" (used, e.g., in [1471 1389| 1333] ). the 
names "optical metric" (see, e.g., [176| 11051 1175[ 1177] ) and "optical (reference) geometry" (see, 
e.g., [11 12141 [3901 IMlEQillS]) are also used. 

In the conformally static case, one can apply the standard Morse theory for Riemannian 
geodesies to the Fermat metric g to get results on the number of y-geodesics joining two points 
in space. This immediately gives results on the number of lightlike geodesies joining a point in 
spacetime to an integral curve oiW = dt- Completeness of the Fermat metric corresponds to global 
hyperbolicity of the spacetime metric. The relevant techniques, and their generalization to (con- 
formally) stationary spacetimes, are detailed in a book by Masiello |272] . (Note that, in contrast 
to standard terminology, Masiello's definition of a stationary spacetime includes the assumption 
that the hypersurfaces t = constant are spacelike.) The resulting Morse theory is a special case 
of the Morse theory for Fermat's principle in globally hyperbolic spacetimes (see Section [373]) . In 
addition to Morse theory, other standard methods from Riemannian geometry have been applied 
to the Fermat metric, e.g., convexity techniques |173l 1174] . 

If the metric ([6T]) is conformally static, 4>^{x) = d^h{x), and if the Fermat metric is conformal 
to the Euclidean metric, g^u{x) = n{x)'^ S fj_,y , the arrival time functional ([62]) can be written as 

T(A) = / n{x) de + constant, (67) 

where £ is Euclidean arclength. Hence, Fermat's principle reduces to its standard optics form 
for an isotropic medium with index of refraction n on Euclidean space. As a consequence, light 
propagation in a spacetime with the assumed properties can be mimicked by a medium with 
an appropriately chosen index of refraction. This remark applies, e.g., to spherically symmetric 
and static spacetimes (see Section 14.31) and, in particular, to the Schwarzschild spacetime (see 
Section 15. ip . The analogy with ordinary optics in media has been used for constructing, in the 
laboratory, analogue models for light propagation in general- relativistic spacetimes (see [311] ). 

Extremizing the functional ([S71) is formally analogous to Maupertuis' principle for a particle in 
a scalar potential on flat space, which is discussed in any book on classical mechanics. Dropping 
the assumption that the Fermat one-form is a differential, but still requiring the Fermat metric to 
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be conformal to the Euclidean metric, corresponds to introducing an additional vector potential. 
This form of the optical-mechanical analogy, for light rays in stationary spacetimes whose Fermat 
metric is conformal to the Euclidean metric, is discussed, e.g., in [7]. 

The conformal factor e^^ in Equation (|6T|) does not affect the paths of light rays. However, it 
does affect redshifts and distance measures (recall Section [2^ . If g is of the form ([6T|) . for every 
lightlike geodesic A the quantity g{X^ dt) is a constant of motion. This leads to a particularly simple 
form of the general redshift formula p6|) . We consider an arbitrary lightlike geodesic s ^ \{s) in 
terms of its coordinate representation s ^ {t{s) , [s) , [s) , {s)) . If both observer and emitter 
are at rest in the sense that their 4- velocities Uq and C/s are parallel to W = ^t^ Equation (p6|) 
can be rewritten as 

log (1 + z{s)) = / (t(s), x{s)) ~ f (t(0), x{0)) . (68) 

This justifies calling / the redshift potential. It is shown in [188] that there is a redshift potential 
for a congruence of timelike curves in a spacetime if and only if the timelike curves are the integral 
curves of a conformal Killing vector field. The notion of a redshift potential or redshift function is 
also discussed in 97 . Note that Equation (|68|) immediately determines the redshift in conformally 
stationary spacetimes for any pair of observer and emitter. If the 4- velocity of the observer or of 
the emitter is not parallel to W = dt, one just has to add the usual special-relativistic Doppler 
factor. 

Conformally stationary spacetimes can be characterized by another interesting property. Let 
W he a, timelike vector field in a spacetime and fix three observers whose worldlines are integral 
curves of W. Then the angle under which two of them are seen by the third one remains constant 
in the course of time, for any choice of the observers, if and only if W is proportional to a conformal 
Killing vector field. For a proof see |188J . 

4.3 Lensing in spherically symmetric and static spacetimes 

The class of spherically symmetric and static spacetimes is of particular relevance in view of lensing, 
because it includes models for non-rotating stars and black holes (see Sections l5.ll 15.21 l5.3p . but 
also for more exotic objects such as wormholes (see Section [5^ . monopoles (see Section [53]) . 
naked singularities (see Section [5T6|) . and Boson or Fermion stars (see Section IS??]) . A spherically 
symmetric and static spacetime can also be used, as a rough approximation, to model a star cluster, 
a galaxy or a cluster of galaxies. Here we collect the relevant formulas for an unspecified spherically 
symmetric and static metric. We find it convenient to write the metric in the form 

As Equation (|69p is a special case of Equation (|61l) , all results of Section 14.21 for conformally 
stationary metrics apply. However, much stronger results are possible because for metrics of the 
form (|69p the geodesic equation is completely integrable. Hence, all relevant quantities can be 
determined explicitly in terms of integrals over the metric coefficients. 

Redshift and Fermat geometry. 

Comparison of Equation (|69[) with the general form (j6ip of a conformally stationary spacetime 
shows that here the redshift potential / is a function of r only, the Fermat one- form vanishes, 
and the Fermat metric g is of the special form 

g = S{r)^ dr^ + R{r)^ {dd^ -t- sin^ d d(p^) . (70) 

This Fermat metric has several interesting applications. E.g., Gibbons and Werner [1751 have 
derived some lensing features of a spherically symmetric static fluid ball by applying the Gauss- 
Bonnet theorem to the corresponding Fermat metric (or optical metric). By Fermat's principle. 
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the geodesies of g coincide with the projection to 3-space of light rays. The travel time (in terms 
of the time coordinate t) of a lightlike curve coincides with the 5-arclength of its projection. By 
symmetry, every g-geodesic stays in a plane through the origin. From Equation (j70l) we read that 
the sphere of radius r has area 47ri?(r)^ with respect to the Fermat metric. Also, Equation (|70|) 
implies that the second fundamental form of this sphere is a multiple of its first fundamental form, 
with a factor -R'{r) {R{r) 5(r))"\ If 

i?'(rp)=0, (71) 

the sphere r = rp is totally geodesic, i.e., a g-geodesic that starts tangent to this sphere remains 
in it. The best known example for such a light sphere or photon sphere is the sphere r = 3m 
in the Schwarzschild spacetime (see Section 15.11) . Light spheres also occur in the spacetimes of 
wormholes (see Section . If R"{rp) < 0, the circular light rays in a light sphere are stable with 
respect to radial perturbations, and if R"{r-p) > 0, they are unstable like in the Schwarschild case. 
The condition under which a spherically symmetric static spacetime admits a light sphere was first 
given by Atkinson [16) . Abramowicz ^ has shown that for an observer traveling along a circular 
light orbit (with subluminal velocity) there is no centrifugal force and no gyroscopic precession. 
Claudel, Virbhadra, and Ellis [81] investigated, with the help of Einstein's field equation and energy 
conditions, the amount of matter surrounded by a light sphere. Among other things, they found 
an energy condition under which a spherically symmetric static black hole must be surrounded by 
a light sphere. A purely kinematical argument shows that any spherically symmetric and static 
spacetime that has a horizon at r = rjj and is asymptotically flat for r — >■ cxi must contain a light 
sphere at some radius between th and 00 (see Hasse and Per lick [191] ). In the same article, it is 
shown that in any spherically symmetric static spacetime with a light sphere there is gravitational 
lensing with infinitely many images. Bozza [45] investigated a strong-field limit of lensing in spher- 
ically symmetric static spacetimes, as opposed to the well-known weak-field limit, which applies 
to light rays that come close to an unstable light sphere. (In later papers, the term "strong-field 
limit" was replaced with "strong-deflection limit" . This is, indeed, more appropriate because the 
gravitational field, measured in terms of tidal forces, need not be particularly strong near an unsta- 
ble light sphere. The characteristic feature is that the bending angle goes to infinity, i.e., that light 
rays make arbitrarily many turns around the center if they approach an unstable light sphere.) 
This limit applies, in particular, to light rays that approach the sphere r — 3m in the Schwarzschild 
spacetime (see [53] and, for illustrations. Figures [161 [T71 and ITS)). The strong-deflection limit has 
also been applied to many other spherically symmetric and static metrics; several examples are 
discussed in Section [S] below. As demonstrated in the original article by Bozza [15], the parameters 
that characterize the strong-defiection limit can be used to distinguish between different black-hole 
metrics. These parameters were related to quasi-normal modes in [386] . 



Index of refraction and embedding diagrams. 

Transformation to an isotropic radius coordinate r via 

Sir)dr _ dr 

R{r) ~ f ^' ' 

takes the Fermat metric ([70]) to the form 

g = n(f)^ {df^ + r^{di}^ + sin^ ^9dip^)) (73) 

where 

n{f) = (74) 
r 

On the right-hand side r has to be expressed by r with the help of Equation (|72l) . The results 
of Section |4?2] implv that the lightlike geodesies in a spherically symmetric static spacetime are 
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equivalent to the light rays in a medium with index of refraction ([74]) on Euclidean 3-space. For 
arbitrary metrics of the form this result is due to Atkinson [16 . It reduces the lightlike 

geodesic problem in a spherically symmetric static spacetime to a standard problem in ordinary 
optics, as treated, e.g., in [266], §27, and [252) . Section 4. One can combine this result with 
our earlier observation that the integral in Equation (|67|) has the same form as the functional 
in Maupertuis' principle in classical mechanics. This demonstrates that light rays in spherically 
symmetric and static spacetimes behave like particles in a spherically symmetric potential on 
Euclidean 3-space (cf., e.g., |135| ). 
If the embeddability condition 

S{rf > R\rf (75) 

is satisfied, we define a function Z{r) by 



Z\r) = ^S{r)^ - R'{r)^. (76) 

Then the Fermat metric (170)) reads 

g = {dR{r)f + R{rf {dd^ + sin^ d V) + {dZ{r)f . (77) 

If restricted to the equatorial plane d — 7r/2, the metric (|77p describes a surface of revolution, 
embedded into Euclidean 3-space as 

(r, if) {R{r) cos R{r) sin ip, Z{r)) . (78) 

Such embeddings of the Fermat geometry have been visualized for several spacetimes of interest 
(see Figure [12] for the Schwarzschild case and |201l 1202] for other examples) . This is quite in- 
structive because from a picture of a surface of revolution one can read the qualitative features 
of its geodesies without calculating them. Note that Equation ([7^ defines the isotropic radius 
coordinate uniquely up to a multiplicative constant. Hence, the straight lines in this coordinate 
representation give us an unambiguously defined reference grid for every spherically symmetric and 
static spacetime. These straight lines have been called triangulation lines in [84] [85], where their 
use for calculating bending angles, exactly or approximately, is outlined. 

Light cone. 

In a spherically symmetric static spacetime, the (past) light cone of an event po can be written in 
terms of integrals over the metric coefficients. We first restrict to the equatorial plane d = tt/2. 
The 5-geodesics are then determined by the Lagrangian 



The Euler-Lagrange equations read 



2 / , X 2 



(SO, 

«(.f*)=0. (81) 

After dividing the first equation by R{r)'^ {dip / di)"^ , and using the second equation, we find 

d ( S{rf dr \ _ S{r)S'{r) fdrV , R'{r) 



dip \R{r)'^ dip J R{rY \dip J R{r) 



(82) 
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Equations (|80l) and (|8T|) give the light rays parametrized by g-arclength (which equals travel time) 
£, Equation (|82p can be used for determining the orbits of light rays if the parametrization plays 
no role. 

For fixed radius value ro, initial conditions 

dr cose 



(84) 



^(o)=o, 5(0)=!;^ 

determine a unique solution r = r{£,Q), ip — of the Euler-Lagrange equations (|80|) and 

(|8ip. O measures the initial direction with respect to the symmetry axis (see Figure |6]). We get all 
light rays issuing from the event r — ro, ~ Q, "d — 7r/2, t — to into the past by letting Q range 
from to TT and applying rotations around the symmetry axis. This gives us the past light cone of 
this event in the form 

/ to-i \ 

r{e,e) sin 0(^,9) cos^- 
r(£,e) sin0(£,e) sin^- 
V r(£,e) cos0(£,e) J 

5* and O are spherical coordinates on the observer's sky. If we let to float over M, we get the 
observational coordinates ^ for an observer on a t-line, up to two modifications. First, to is not 
the same as proper time r; however, along each i-line they are related just by a constant, 

^ = e-f^^o\ (85) 
dto 

Second, £ is not the same as the affine parameter s; along a ray with initial direction Q, they are 
related by 



The constants of motion 



2 / , \ 2 



Rirr^=Riro)sine, .(.)M- + BirY ] =1 (87) 



jive us the functions r{£, Q), (j){£, 8) in terms of integrals, 

r-r{Le) R(r)S{r)dr 



R{rY - R{roY sin^ 8 



(88) 



(b{£,e) ^ R{r o) sin Q ^ ^ (89) 

•^'■o - R{r) ^ R{ry - R{r o^ sin"^ e 

Here the notation with the dots is a short-hand; it means that the integral is to be decomposed into 
sections where r{£, 8) is a monotonous function of ^, and that the absolute value of the integrals over 
all sections have to be added up. Turning points occur at radius values where R{r) = R{ro) sin 8 
and R'{r) ^ (see Figure fTO|l . If the metric coefficients S and R have been specified, these integrals 
can be calculated and give us the light cone (see Figure [13] for an example). Having parametrized 
the rays with g-arclength (= travel time), we immediately get the intersections of the light cone 
with hypersurfaces t = constant ( "instantaneous wave fronts" ) ; see Figures [T21 and [^Hl 
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Exact lens map and various approximation methods. 



Recall from Section 12.11 that the exact lens map |154j refers to a chosen observation event po 
and a chosen "source surface" T. In general, for T we may choose any 3-dimensional submanifold 
that is ruled by timelike curves. The latter are to be interpreted as wordlines of light sources. In a 
spherically symmetric and static spacetime, we may take advantage of the symmetry by choosing 
for T a sphere r = rg with its ruling by the i-lines. This restricts the consideration to lensing for 
static light sources. Note that, for an observer at ro, all static light sources at radius rg undergo 
the same redshift, log(l + z) = f{rs) — fifo)- Without loss of generality, we place the observation 
event po on the 3-axis. This gives us the past light cone in the representation ([84|) . To each ray 
from the observer, with initial direction characterized by O, we can assign the total angle $(9) 
the ray sweeps out on its way from ro to rs (see Figure[6]). $(9) is given by Equation (|89|) . 

<i>(9) = i?(ro) sin9 / , ^ ' , (90) 

•^'o - R{r)^R{rf - R{rof sin^ 9 

where the same short-hand notation is used as in Equation (|89p . $(9) is not necessarily defined for 
all 9 because some light rays that start at ro may not reach rs. Also, $(9) may be multi- valued 
because a light ray may intersect the sphere r = rs several times. Equation (IM)) gives us the 
(possibly multi-valued) lens map 

(rs sin $(9) cos^'X 
rs sin $(9) sin* . (91) 
rs cos $(9) / 

This version of the exact lens map in spherically symmetric and static spacetimes was first con- 
sidered in [339] . It is interesting to compare it with the standard lens map (or lens equation) in 
the quasi-Newtonian approximation formalism, see e.g. Wambsganss |427| . Section 3.1. In both 
cases, rotational symmetry about the axis through the observer has the effect that in essence the 
lens map reduces to a map from an angle to another angle; the first angle, here 9, determines the 
position of the image on the observer's sky, the second angle, here $(9), gives the actual position 
of the light source. If the metric coefficients R(r) and S(r) are given, the integrals in Equation 
(|90|) can be numerically calculated and from the result all lensing features can be determined with 
arbitrary accuracy. As an example, the exact lens map will be evaluated for the Schwarzschild 
metric in Section [01 below. In [339) ). the examples of an Ellis wormhole (cf. Section [5^ and of a 
Barriola-Vilenkin monopole (cf. Section [5T5|) were treated. - Note that $(9) may take any value 
between and infinity. A value $(9) > 27r occurs whenever a light ray makes more than one full 
turn around the center. For each image we can define the order 

i(9) = min{TO e N| $(9) < m7r} , (92) 

which counts how often the ray has crossed the axis. (If the lens map is multi-valued, one should 
introduce an index to label different images that correspond to the same angle 9). In accordance 
with the terminology introduced in Section 13. 1[ an image of order is called primary, an image of 
order 1 is called secondary, and so on. The standard example where images of arbitrarily high order 
occur is the Schwarzschild spacetime (see Section l5.ip . For a light source which is not perfectly 
aligned with the observer and the center, images of even order have even parity and line up on 
one side of the direction towards the center; images of odd order have odd parity and line up on 
the other side of the direction towards the center. In the case of perfect alignment, a sequence of 
Einstein rings is seen. An Einstein ring of order is called primary, an Einstein ring of order 1 is 
called secondary, and so on. - We can rewrite the exact lens map in a spherically symmetric and 
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static spacetime in a form more similar to the standard quasi-Newtonian lens map if we make two 
additional assumptions which are satisfied in many, though not all, situations of interest: 



• The spacetime is asymptotically flat and both ro and rg are very large. 

• The source is almost exactly opposite to the observer, i.e., $(0) is close to an odd multiple 

of TT. 

The first assumption makes sure that the lens map is single- valued, and both assumptions together 
imply that along each (past-oriented) light ray from the observer to the source the radius coordinate 
has precisely one turning-point. For a light ray with turning point at r,„(0), the asymptotic 
assumption allows to approximate Equation ([UO]) by 

$(6) =2/°° S^}^^_^ ^^3^ 



R{r)^ R{rY - R{roY sin^ 6 



To link up with the notation of the standard lens map, we introduce distances Dd and Dds and 
angles — tt — 9, j3{9) and a{9) according to Figure [T) The alignment assumption implies that 
j3[d) is small, and the asymptotic condition implies that the bending angle a can be approximated 
as 

a{e) = $(6) - TT (94) 
After some elementary geometry, one finds that 

tan/3(6') = tanfl- -^^-^^^ tan 6* -I- tan (a (6*) - 6*)^ (95) 

This is the lens equation of Virbhadra and Ellis |420] (cf. |422] for an earlier version). Equation 
([^ gives a well-defined (single- valued) lens map 9 ^ (3 (9) if we insert Equations ([M]) and ([Mt . 
The Virbhadra-EUis lens map may be called "almost exact" . It is based on approximations as to 
the positions of source and observer, but it is not restricted to the case that the bending angle is 
small. As a matter of fact, the bending angle may be arbitrarily large; a{9) diverges to infinity if 
the turning point ^^(0) approaches an unstable light sphere. (It was already mentioned that an 
unstable light sphere occurs at a radius value rp if and only if R'{rp) = and R"{rp) > 0; the 
standard example is the sphere at Vp = 3m in the Schwarzschild spacetime, see Section [5.ip . It was 
shown by Bozza [4'8j that, whenever an unstable light sphere is approached, the divergence of the 
bending angle is logarithmic. The Virbhadra-Ellis lens equation was originally introduced for the 
Schwarzschild metric [420] where it approximates the exact treatment remarkably well within a 
wide range of validity |150J . In comparison to the exact lens map, the Virbhadra-Ellis lens map has 
the appealing property of resembling the standard quasi-Newtonian lens map as much as possible. 
On the other hand, neither analytical nor numerical evaluation of the "almost exact lens map" is 
significantly easier than that of the exact lens map. The Virbhadra-Ellis lens map was successfully 
applied to many spherically symmetric and static spacetimes, several examples are considered in 
Section [5] below. Bozza |50| compared the Virbhadra-Ellis lens equation with other approximate 
lens equations that had been proposed for spherically symmetric and static spacetimes and, in 
particular, for the Schwarzschild metric: 

• the Ohanian lens equation, which was implicitly contained in Ohanian's pioneering work [314] 
on Schwarzschild lensing, 



a modification of the Virbhadra-Ellis lens equation, introduced by Daljrowski and Schunck | 
in their treatment of lensing by a boson star. 
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• a lens equation introduced by Bozza and Sereno j58j that is essentially equivalent to the 
Ohanian lens equation but replaces an angle centered at the lens by an angle centered at the 
observer, 

• a new lens equation that is, again, a slight modification of the Ohanian lens equation. 

All these lens equations relax the alignment condition but retain some kind of asymptotic assump- 
tion. After discussing the accuracy of these various lens equations in realistic situations, Bozza 
argues in favour of the Ohanian lens equation and its modifications. - In addition to approxi- 
mative lens equations, several other approximation techniques have been developed for lensing in 
spherically symmetric and static spacetimes. Amore and Arceo [S] expressed the bending angle 
analytically as a rapidly convergent series; this approach was further developed in [10l[TT]. Keeton 
and Fetters [230] expanded corrections to the weak-deflection limit as a Taylor series in the gravi- 
tational radius of the lens. In two follow-up papers, they applied this formalism to post-Newtonian 
metrics [231^ and to braneworld black holes [232] • A major purpose of all approximation methods 
mentioned is to test general relativity by comparing Scliwarzschild lensing to lensing in alternative 
theories of gravity, see Section 15.11 




Figure 6: Illustration of the exact lens map in spherically symmetric static spacetimes. The picture 
shows a spatial plane. The observation event (dot) is at r = tq, static light sources are distributed 
at r = rs. 8 is the colatitude coordinate on the observer's sky. It takes values between and tt. 
$(0) is the angle swept out by the ray with initial direction O on its way from ro to rs. It takes 
values between and oo. In general, neither existence nor uniqueness of $(0) is guaranteed for 
given Q. A similar picture is in [339]. 

Distance measures, image distortion and brightness of images. 

For calculating image distortion (see Section [23]) and the brightness of images (see Section [23| we 
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Figure 7: Illustration of the "almost exact" lens map of Virbhadra and Ellis [420] . As almost 
perfect alignment is assumed, we may think of the light sources as being distributed on a plane, 
rather than on a sphere as for the exact lens map. and D^s are measured in terms of the radial 
coordinate. The difference between and 9 — n — Q must be observed when comparing with the 
exact lens map. I3(9) is the angle between the axis and a straight line, in the coordinate picture, 
that connects observer and source. In contrast to the angle ^(O), the angle /3(6') is not invariant 
under a transformation of the radial coordinate; neither $(9) nor /3{9) is observable. Note that 
the Virbhadra- Ellis lens map also applies to cases where $(0) is bigger than 2n. 
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have to consider infinitesimally thin bundles with vertex at the observer. In a spherically symmetric 
and static spacetime, we can apply the orthonormal derivative operators Oq and sin 9 9* to the 
representation (|84l) of the past light cone. Along each ray, this gives us two Jacobi fields Yi and 
Y2 which span an infinitesimally thin bundle with vertex at the observer. Yi points in the radial 
direction and Y2 points in the tangential direction (see Figure [8|). The radial and the tangential 
direction are orthogonal to each other and, by symmetry, parallel-transported along each ray. Thus, 
we can choose the Sachs basis (E'l, E2) such that Yi = D+Ei and I2 = D^E2- The coefficients D+ 
and I?_ are unique if we require them to be positive near the vertex. and D_ are the extremal 
angular diameter distances of Section \TM with respect to a static observer (because the (^E", 8)-grid 
refers to a static observer). In the case at hand, they are called the radial and tangential angular 
diameter distances. They can be calculated by normalizing Yi and Y2, 

D+(^,e) = e-^(^(^'®»i?(ro) cose ^Rir{£,e)f - Rirl) sin^ 6 



y/i?(r)2 - i?(ro)2 sin^e' 



D_(£,e) = e/(^(^'«»i?(r(£,e))5^^i^i^. 

smB 



(97) 



These formulas have been derived first for the special case of the Schwarzschild metric by Dwivedi 
and Kantowski |110) and then for arbitrary spherically symmetric static spacetimes by Dyer [111] . 
(In [111) . Equation (1^7]) is erroneously given only for the case that, in our notation, ef'^^'^R{r) = 
r.) From these formulas we immediately get the area distance -Daroa = \/\D+D^ \ for a static 
observer and, with the help of the redshift z, the luminosity distance -Dium = (1 + •z)^^arca (recall 
Section 12. 4p . In this way. Equation (|96|) and Equation (|97|) allow to calculate the brightness of 
images according to the formulas of Section [2?6l Similarly, Equation (|96p and Equation ([97|) allow 
to calculate image distortion in terms of the ellipticity e (recall Section In general, e is a 

complex quantity, defined by Equation (j49p . In the case at hand, it reduces to the real quantity 
e — Z?_ /-D+ — -D+ /-D- . The expansion 9 and the shear a of the bundles under consideration can 
be calculated from Kantowski's formula |222[ IllOj . 

b± = {e±a)D±, (98) 

to which Equation (|27p reduces in the case at hand. The dot {— derivative with respect to the 
affine parameter s) is related to the derivative with respect to t by Equation (|86p . Evaluating 
Equations (j96l I97|) in connection with the exact lens map leads to quite convenient formulas, for 
static light sources at r = r^. Setting r(^, 6) = rs and 0(^,6) — $(0) and comparing with 
Equation ^ yields (cf. [339 J 



D+{Q) = eJ^''^^ yR{rsy - RijoY sin^ 9 $'(6), (99) 
D_(e) = e-''^''") i?(rs) sin$(e). (100) 

These formulas immediately give image distortion and the brightness of images if the map 9 1— >■ 
$(9) is known. 



Caustics of light cones. 

Quite generally, the past light cone has a caustic point exactly where at least one of the extremal 
angular diameter distances Z3_ vanishes (see Sections 12.21 12. 3[ and l2.4p . In the case at hand, 
zeros of I?+ arc called radial caustic points and zeros of D- are called tangential caustic points 
(see Figure [9]). By Equation ([97)) . tangential caustic points occur if (/)(£, 9) is a multiple of tt, i.e.. 
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Figure 8: Thin bundle around a ray in a spherically symmetric static spacetime. The picture 
is purely spatial, i.e., the time coordinate t is not shown. The ray is contained in a plane, so 
there are two distinguished spatial directions orthogonal to the ray: the "radial" direction (in the 
plane) and the "tangential" direction (orthogonal to the plane). For a bundle with vertex at the 
observer, the radial diameter of the cross-section equals 2|Z?_(.|, and the tangential diameter of the 
cross-section equals 2|D_|. Up to the first caustic point, and Z)_ arc positive. In contrast to 
the general situation of Figure [3l here the angle x is always zero (if the Sachs basis {Ei,E2) is 
chosen appropriately). 

whenever a light ray crosses the axis of symmetry through the observer (see Figure [5]). Symmetry 
implies that a point source is seen as a ring ( "Einstein ring" ) if its worldline crosses a tangential 
caustic point. By contrast, a point source whose wordline crosses a radial caustic point is seen 
infinitesimally extended in the radial direction. The set of all tangential caustic points of the past 
light cone is called the tangential caustic for short. In general, it has several connected components. 
In accordance with the order of images, as defined in Equation (j92l) . these connected components 
can be labeled as primary, secondary, etc. tangential caustics. Each connected component is 
a spacelike curve in spacetime which projects to (part of) the axis of symmetry through the 
observer. The radial caustic is a lightlike surface in spacetime unless at points where it meets the 
axis; its projection to space is rotationally symmetric around the axis. The best known example 
for a tangential caustic, with infinitely many connected components, occurs in the Schwarzschild 
spacetime (see Figure [T5|). It is also instructive to visualize radial and tangential caustics in terms 
of instantaneous wave fronts, i.e., intersections of the light cone with hypersurfaces t — constant. 
Examples are shown in Figures [HI [HI and [20l By symmetry, a tangential caustic point of an 
instantaneous wave front can be neither a cusp nor a swallow-tail. Hence, the general result of 
Section [22] implies that the tangential caustic is always unstable. The radial caustic in Figure [20] 
consists of cusps and is, thus, stable. 

4.4 Lensing in axisymmetric stationary spacetimes 

Axisymmetric stationary spacetimes are of interest in view of lensing as general-relativistic models 
for rotating deflectors. The best known and most important example is the Kerr metric which 
describes a rotating black hole (see Section [575]) . For non-collapsed rotating objects, exact solu- 
tions of Einstein's field equation are known only for the idealized cases of infinitely long cylinders 
(including string models; see Section [5. lOp and disks (see Section [HH]). Here we collect, as a prepa- 
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Figure 9: Tangential and radial caustic points. Tangential caustic points, D- = 0, occur on the 
axis of symmetry through the observer. A (point) source at a tangential caustic point is seen as 
a (1-dimensional) Einstein ring on the observer's sky. A point source at a radial caustic point, 
I?+ = 0, appears "infinitesimally extended" in the radial direction. 



ration for these examples, some formulas for an unspecified axisymmetric stationary metric. The 
latter can be written in coordinates {y^ , , if, t), with capital indices A, B, . . . taking the values 1 
and 2, as 

5 = 9tt{y) dt^ + 2gt^{y) dtdip + g^^{y) dip^ + gAsiv) dy^ dy^ , (101) 

where all metric coefficients depend on y — {y^,y'^) only. We assume that the integral curves of 
dip are closed, with the usual (27r)-periodicity, and that the 2-dimensional orbits spanned by 
and dt are timelike. Then the Lorentzian signature of g implies that gAsiy) is positive definite. 
In general, the vector field dt need not be timelike and the hypersurfaces t = constant need not be 
spacelike. Our assumptions allow for transformations {ip, t) t-^ (tp + Qt, t) with a constant il. If, 
by such a transformation, we can achieve that gtt < everywhere, we can use the purely spatial 
formalism for light rays in terms of the Fermat geometry (recall Section 14. 2p . Comparison of 
Equation (jlOip with Equation (|6ip shows that the redshift potential /, the Fermat metric g, and 
the Fermat one-form (j) are 

e'^ = -gtt, (102) 

g = .9AB ^^B _^ d^2^ (^03) 

9tt gtt 
^ = -il^dip, (104) 

gtt 

respectively. If it is not possible to make gtt negative on the entire spacetime domain under 
consideration, the Fermat geometry is defined only locally and, therefore, of limited usefulness. 
This is the case, e.g., for the Kerr metric where, in Boyer-Lindquist coordinates, gtt is positive in 
the ergosphere (see Section [5^ . 

Variational techniques related to Fermat 's principal in stationary spacetimes are detailed in a 
book by Masiello |272| . Note that, in contrast to standard terminology, Masiello's definition of 
stationarity includes the assumption that the surfaces t = constant are spacelike. 
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For a rotating body with an equatorial plane (i.e., with reflectional symmetry), the Fermat 
metric of the equatorial plane can be represented by an embedding diagram, in analogy to the 
spherically symmetric static case (recall Figure [T^ . However, one should keep in mind that in the 
non-static case the lightlike geodesies do not correspond to the geodesies of g but are affected, in 
addition, by a sort of Coriolis force produced by cf). For a review on embedding diagrams, including 
several examples (see }201] ). 
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5 Examples 



5.1 Schwarzschild spacetime 

The (exterior) Schwarzschild metric 



— )dt^+(l-—) ' dr^ + {d^^ + sin^ ^ V) (105) 



r / \ r 



has the form ([55]) with 



e^nr) ^S{r)-' R{r)= , =. (106) 

r 2m 



It is the unique spherically symmetric vacuum solution of Einstein's field equation. At the same 
time, it is the most important and best understood spacetime in which lensing can be explicitly 
studied without approximations. Schwarzschild lensing beyond the weak-field approximation has 
astrophysical relevance in view of black holes and neutron stars. The increasing evidence that 
there is a supermassive black hole at the center of our Galaxy (see [136] for background material) 
is a major motivation for a detailed study of Schwarzschild lensing (and of Kerr lensing; see Sec- 
tion 15. 8p . In the following we consider the Schwarzschild metric with a constant m > and we 
ignore the region r < for which the singularity at r = is naked. The Schwarzschild metric is 
static on the region 2m < r < oo. (The region r < for m > is equivalent to the region r > 
for 771 < 0. It is usually considered as unphysical but has found some recent interest in connection 
with lensing by wormholes; see Section [5. 41 ) 



Historical notes. 

Shortly after the discovery of the Schwarzschild metric by Schwarzschild [372] and independently 
by Droste [106] , basic features of its lightlike geodesies were found by Flamm [144' , Hilbert [198] , 
and Weyl [430] . Detailed studies of its timelike and lightlike geodesies were made by Hagihara |183j 
and Darwin [951 196]. For a fairly complete list of the pre-1979 literature on Schwarzschild geodesies 
see Sharp [380] . All modern text-books on general relativity include a section on Schwarzschild 
geodesies, but not all of them go beyond the weak-field approximation. For a particularly detailed 
exposition see Chandrasekhar [75] . 

Redshift and Fermat geometry. 

The redshift potential / for the Schwarzschild metric is given in Equation (I106p . With the help 
of / we can directly calculate the redshift via Equation (|55)) if observer and light source are static 
(i.e., lines). If the light source or the observer does not follow a t-\me, a Doppler factor has to be 
added. Independent of the velocity of observer and light source, the redshift becomes arbitrarily 
large if the light source is sufiiciently close to the horizon. For light source and observer freely 
falling, the redshift formula was discussed by Bazahski and Jaranowski 30 . If projected to 3-space, 
the light rays in the Schwarzschild spacetime are the geodesies of the Fermat metric which can be 
read from Equation ([70]) (cf. Frankel [147] ). 

dr^ r'^id-d'^ + sin-d dw"^) 

' ^ -. (107) 



^ 2m ^2 2 2m 

The metric coefficient R{r), as given by Equation (|106|) . has a strict minimum at r = 3m and 
no other extrema (see Figure [10]). Hence, there is an unstable light sphere at this radius (recall 
Equation (|7T])). The existence of circular light rays at r = 3m was noted already by Hilbert [198] . 
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The relevance of these circular light rays in view of lensing was clearly seen by Darwin [95l l96] 
and Atkinson [TB]. They realized, in particular, that a Schwarzschild black hole produces infinitely 
many images of each light source, corresponding to an infinite sequence of light rays whose limit 
curve asymptotically spirals towards a circular light ray. The circular light rays at r = 3m are also 
associated with other physical effects such as centrifugal force reversal and "locking" of gyroscopes. 
These effects have been discussed with the help of the Fermat geometry (= optical reference geom- 
etry) in various articles by Abramowicz and collaborators (see, e.g., IH [6l [2] ) . 




Figure 10: The function R{r) for the Schwarzschild metric. Light rays that start at ro with 
initial direction 6 are confined to the region where R{r) > R{ro) sin 9. The equation i?(3m) = 
R{ro) sin (5 defines for each ro a critical value 5. A light ray from ro with Q = S asymptotically 
approaches r — 3m. 

Index of refraction and embedding diagrams. 

We know from Section |4]3] that light rays in any spherically symmetric and static spacetime can be 
characterized by an index of refraction. This requires introducing an isotropic radius coordinate f 
via Equation ([72|) . In the Schwarzschild case, f is related to the Schwarzschild radius coordinate r 

by 

f =- (\/r^ - 2mr + r - mV ^ ^ (2r + m) ^ 
2 V / 4r 

f ranges from m/2 to infinity if r ranges from 2m to infinity. In terms of the isotropic coordinate, 
the Fermat metric (|107p takes the form 

g = n{r f {dr^ + {d^^ + sin^ i} d(p^)) (109) 

with 

• (110) 
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Hence, light propagation in the Schwarzschild metric can be mimicked by the index of refrac- 
tion (|110p : see Figure [TT] The index of refraction (|110l) is known since Weyl |432| . It was employed 
for calculating lightlike Schwarzschild geodesies, exactly or approximately, e.g., in [TCI 12961 1134[ 
1258) . This index of refraction can be modeled by a fluid flow [358'. The embeddability condi- 
tion (|75|) is satisfied for r > 2.25m (which coincides with the so-called Buchdahl limit). On this 
domain the Fermat geometry, if restricted to the equatorial plane 'd = 7r/2, can be represented 
as a surface of revolution in Euclidean 3-space (see Figure fT2|) . The entire region r > 2m can be 
isometrically embedded into a space of constant negative curvature [3]- 



n(f) 



1 




m/2 m 

Figure 11: Index of refraction n(r), given by Equation (jllOl) . for the Schwarzschild metric as a 
function of the isotropic coordinate r. 

Lensing by a Schwarzschild black hole. 

To get a Schwarzschild black hole, one joins at r = 2m the static Schwarzschild region 2m < ?■ < oo 
to the non-static Schwarzschild region < r < 2m in such a way that ingoing light rays can 
cross this surface but outgoing cannot. If the observation event po is at ro > 2m, only the 
region r > 2m is of relevance for lensing, because the past light cone of such an event does 
not intersect the black- hole horizon at r = 2m. (For a Schwarzschild white hole see below.) 
Such a light cone is depicted in Figure [13] (cf. [236J ). The picture was produced with the help 
of the representation (|84|) which requires integrating Equation (|88|1 and Equation ([89|l . For the 
Schwarzschild case, these are elliptical integrals. Their numerical evaluation is an exercise for 
students (see for a MATHEMATICA program). Note that the evaluation of Equation ((HH|) 
and Equation (|89)) requires knowledge of the turning points. In the Schwarzschild case, there is at 
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r = 2.25m 



Figure 12: Fermat geometry of the equatorial plane of the Schwarzschild spacetime, embedded as a 
surface of revolution into Euchdean 3-space. The neck is at r = 3m (i.e., f w 1.87m), the boundary 
of the embeddable part at r = 2.25m (i.e., f = m). The geodesies of this surface of revolution give 
the light rays in the Schwarzschild spacetime. A similar figure can be found in 4 (also cf. f201'). 

most one turning point rm{Q) along each ray (see Figure [TU|). and it is given by the cubic equation 



The representation (|84p in terms of Fermat arclength £ (= travel time) gives us the intersections of 
the light cone with hypersurfaces t = constant. These "instantaneous wave fronts" are depicted in 
Figure [14] (cf. |184j ). With the light cone explicitly known, one can analytically verify that every 
inextendible timelike curve in the region r > 2m intersects the light cone infinitely many times, 
provided it is bounded away from the horizon and from (past lightlike) infinity. This shows that 
the observer sees infinitely many images of a light source with this worldline. The same result can 
be proven with the help of Morse theory (see Section 13. 3|) , where one has to exclude the case that 
the worldline meets the caustic of the light cone. In the latter case the light source is seen as an 
Einstein ring. Note that a moving source might appear simultaneously as a point image and as an 
Einstein ring on the observer's sky. For static light sources (i.e., i-lines), however, either all images 
are Einstein rings or none. For such light sources we can study lensing in the exact-lens-map 
formulation of Section 143] fsee in particular Figure [6]). Also, Section [43] provides us with formulas 
for distance measures, brightness, and image distortion which we just have to specialize to the 
Schwarzschild case. For another treatment of Schwarzschild lensing with the help of the exact lens 
map, see jl50j . We place our static light sources at radius rg. If ro < rg and 3m < rs, only light 
rays with Q < 5, 



can reach the radius value rg (see Figure [TU]). Rays with Q — S asymptotically spiral towards the 
light sphere at r = 3m. 5 lies between and 7r/2 for tq < 3m and between 7r/2 and tt for tq > 3m. 
The escape cone defined by Equation (|112[) is depicted, for different values of tq, in Figure [T5l It 
gives the domain of definition for the lens map. The lens map is graphically discussed in Figure [TO] 
The pictures are valid for ro — 5m and rg — 10m. Qualitatively, however, they look the same for 
all cases with rg > ro and rg > 3m. From the diagram one can read the position of the infinitely 
many images for each light source which, for the two light sources on the axis, degenerate into 
infinitely many Einstein rings. For each fixed source, the images are ordered by the number i 



r-m(e)^ (ro - 2m) - r„(e) r^ sin^ 6 + 2mr^ sin^ 6 = 0. 



(Ill) 




(112) 
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(=0,1,2,3,...) which counts how often the ray has crossed the axis. This coincides with ordering 
according to travel time. With increasing order i, the images come closer and closer to the rim at 
Q = S (see Figure [T5|) and their brightness decreases rapidly (see Figure [T5)) . For a light source 
not on the axis, images of even order are upright and line up on one side of the direction towards 
the center, images of odd order are side-inverted (see Figure I17p and line up on the other side 
of the direction towards the center (see Figure [T6|) . These basic features of Schwarzschild lensing 
are known since pioneering papers by Darwin [95j and Atkinson [16] (cf. [2651 1314] 1254) ). Various 
methods of how multiple imaging by a black hole could be discovered, directly or indirectly, have 
been discussed |265l 12541 [211 [20l 13421 199] . Related work has also been done for Kerr black holes (see 
Section [578]) . An interesting suggestion was made in j204j . A Schwarzschild black hole, somewhere 
in the universe, would send photons originating from our Sun back to the vicinity of our Sun 
("boomerang photons" |394j ) . If the black hole is sufficiently close to our Solar system, this would 
produce images of our own Sun on the sky that could be detectable. Quite generally, one speaks 
of retrolensing when a gravitating mass sends light back into approximately the same direction 
from which it has come in. Retrolensed images have not been observed so far, the perspectives are 
discussed, e.g., in [99| 1125] . - The lensing effect of a Schwarzschild black hole has been visualized 
in two ways: 

1. by showing the visual appearance of some background pattern as distorted by the black 
hole [53 IMS! IMl [m IMO], also cf. [Ml 1^ . 

2. by showing the visual appearence of an accretion disk around the black hole [265|I161]I21II20] . 
also cf. [SUIMIIST]. 

In the course of time the ray tracing programs on which these visualizations are based have become 
more and more advanced, taking not only redshift and magnification (including higher-order im- 
ages) but also Fraunhofer diffraction (due to the finite aperture of the observer's eye) or scattering 
into account. Ray tracing programs have also been developed for the more general case of the Kerr 
metric, see Section 15.81 - Interest in Schwarzschild lensing (and Kerr lensing) beyond the weak- 
field approximation has greatly increased with the growing evidence that there is a supermassive 
black hole at the center of our galaxy, and probably at the center of most galaxies. Higher-order 
images, where a light ray makes at least one full turn around the center, have not been observed 
so far, but they are thought to be relevant for future observations. It was already emphasized 
that, even if the bending angles are arbitrarily large, all lensing properties of a Schwarzschild black 
hole can be calculated exactly, in terms of elliptic integrals; then these integrals can be evaluated 
numerically with arbitrary accuracy and the results can be discussed graphically, as exemplified 
in Figures I16[ 1171 and 1181 However, for practical purposes many authors found it convenient to 
develop approximation methods that go beyond, or are complementary to, the weak-field approx- 
imation, rather than to work with the exact formulas. Two approximation methods have proven 
particularly useful: Virbhadra and Ellis [42^ developed a lens equation that applies to the case 
that source and observer are in the asymptotic region and approximately aligned with the center, 
but is not restricted to light rays that remain in the asymptotic region. Bozza et al. [53] |48] 
introduced a strong-field limit (or strong-defiection limit) that describes light rays the better the 
more turns they make around the center. Both methods, along with other approximation tech- 
niques [9] [ini dH I230| I23H 1232] for light bending in spherically-symmetric and static spacetimes, 
have already been discussed in Section 14.31 above. Especially for the Schwarzschild metric, Iyer 
and Fetters |215j have demonstrated that, by combining a strong-deflection series expansion and 
a weak-deflection series expansion, one gets an approximation that is within 1 % of the exact 
bending angle value for light rays traversing anywhere between the photon sphere and infinity. 
A main goal of all these endeavours is to provide a new test of general relativity with the help 
of higher-order images, once they have been observed. As shown by Bozza [42], the separation 
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of higher-order images and their decrease in magnitude can be used for discriminating between 
different black holes. Hence, the observation of higher-order images would reveal if the bending 
object can be modeled as a Schwarzschild black hole, or if an alternative model has to be used. 
An example for such an alternative model is the Reissner-Nordstrom black hole (see Section [573)) . 
Other spherically-symmetric and static black hole models have found some interest because their 
existence is predicted by alternative theories of gravity. E.g., the bending properties have been 
worked out for black holes from string theory [39] , from braneworld gravity j224| I433| \122\ 11231 HI] i 
from Einstein-Born-Infeld theory !124;, from dilaton theory |286l 1168] and from Hofava-Lifshitz 
gravity [76]. Up to now there is no observational indication that any of these black holes exist in 
nature. The future observation of higher-order images could help to find out if they exist. For the 
time being, all observations are in agreement with the assumption that the existing black holes 
are Schwarzschild or Kerr black holes, as predicted by standard general relativity. Schwarzschild 
lensing as a tool for probing the supermassive objects at the center of galaxies is discussed in detail 
by Virbhadra [419J . For a recent review on black hole lensing in general see Bozza [52_, . 
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Figure 13: Past light cone in the Schwarzschild spacetime. One sees that the light cone wraps 
around the horizon, then forms a tangential caustic. In the picture the caustic looks like a transverse 
self-intersection because one spatial dimension is suppressed. (Only the hyperplane = 7r/2 is 
shown.) There is no radial caustic. If one follows the light rays further back in time, the light cone 
wraps around the horizon again and again, thereby forming infinitely many tangential caustics 
which alternately cover the radius line through the observer and the radius line opposite to the 
observer. In spacetime, each caustic is a spacelike curve along which r ranges from 2m to oo, 
whereas t ranges from — cx) to some maximal value and then back to — oo. Equal-time sections of 
this light cone are shown in Figure [14] 
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Figure 14: Instantaneous wave fronts of the light cone in the Schwarzschild spacetime. This picture 
shows intersections of the light cone in Figure [T5] with hypersurfaces t = constant for four i- values, 
with ti > t2 > > t^. The instantaneous wave fronts wrap around the horizon and, after reaching 
the first caustic, have two caustic points each. If one goes further back in time than shown in the 
picture, the wave fronts another time wrap around the horizon, reach the second caustic, and now 
have four caustic points each, and so on. In comparison to Figure [TSj the representation in terms 
of instantaneous wave fronts has the advantage that all three spatial dimensions are shown. 



61 




Figure 15: Escape cones in the Schwarzschild metric, for five values of ro- For an observer at 
radius ro, light sources distributed at a radius rs with rs > ro and rg > 3m illuminate a disk 
whose angular radius d is given by Equation (|112p . The boundary of this disk corresponds to light 
rays that spiral towards the light sphere at r = 3m. If ro is big, the bright disk covers almost the 
whole sky, leaving a small dark disk that is called the "shadow" of the black hole. With decreasing 
ro, the shadow becomes bigger and bigger until, for ro — > 2m, it covers the whole sky. Figure [TU] 
illustrates that the notion of escape cones is meaningful for any spherically symmetric and static 
spacetime where R has one minimum and no other extrema [321) . For the Schwarzschild spacetime, 
the escape cones were first mentioned in [3171 1277] , and explicitly calculated in |398] . A picture 
similar to this one can be found, e.g., in [75] . p. 130. 



Lensing by a non-transparent Schwarzschild star. 

To model a non-transparent star of radius r* one has to restrict the exterior Schwarzschild metric 
to the region r > r*. Lightlike geodesies terminate when they arrive at r = r*. The star's radius 
cannot be smaller than 2m unless it is allowed to be time-dependent. The qualitative features of 
lensing depend on whether r* is bigger than 3m. Stars with 2m < r* < 3m are called ultracom- 
pact [214] . Their existence is speculative. The lensing properties of an ultracompact star are the 
same as that of a Schwarzschild black hole of the same mass, for observer and light source in the 
region r > r*. In particular, the apparent angular radius S on the observer's sky of an ultracom- 
pact star is given by the escape cone of Figure [15] Also, an ultracompact star produces the same 
infinite sequence of images of each light source as a black hole. For r* > 3m, only finitely many of 
the images survive because the other lightlike geodesies are blocked. A non-transparent star has a 
finite focal length rf > 2m in the sense that parallel light from infinity is focused along a line that 
extends from radius value rt to infinity, rf depends on m and on r* . For the values of our Sun one 
finds rt — 550 au (1 au = 1 astronomical unit = average distance from the Earth to the Sun). An 
observer at r > rt can observe strong lensing effects of the Sun on distant light sources. The idea 
of sending a spacecraft to r > rt was occasionally discussed in the literature [424] 1301] 1407] . The 
lensing properties of a non-transparent Schwarzschild star have been illustrated by showing the 
appearance of the star's surface to a distant observer. For r* bigger than but of the same order of 
magnitude as 3m, this has relevance for neutron stars (see |436l 1324] 11601 15551 12751 IWT] ). r* may 
be chosen time-dependent, e.g., to model a non-transparent collapsing star. A star starting with 
r* > 2m cannot reach r = 2to in finite Schwarzschild coordinate time t (though in finite proper 
time of an observer at the star's surface), i.e., for a collapsing star one has r*(i) — >■ 2m for t ^ oo. 
To a distant observer, the total luminosity of a freely (geodesically) collapsing star is attenuated 
exponentially, L{t) oc exp (-t(3V3m)-i). This formula was first derived by Podurets [3481 with 
an incorrect factor 2 under the exponent and corrected by Ames and Thorne 0. Both papers 
are based on kinetic photon theory (Liouville's equation). An alternative derivation of the lumi- 
nosity formula, based on the optical scalars, was given by Dwivedi and Kantowski [110] . Ames 
and Thorne also calculated the spectral distribution of the radiation as a function of time and 
position on the apparent disk of the star. All these analyses considered radiation emitted at an 
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Figure 16: Lens map for the Schwarzschild metric. The observer is at ro — 5m, the hght sources 
are at rs = 10m. Q is the colatitude on the observer's sky and $(9) is the angle swept out by 
the ray (see Figure H]). $(9) was calculated with the help of Equation ([90)) . 9 is restricted by the 
opening angle 6 of the observer's escape cone (see Figure [T5t . Rays with Q ~ S asymptotically 
spiral towards the light sphere at r = 3m. The first diagram (cf. |150j . Figure 5) shows that 
$(9) ranges from to oo if 9 ranges from to 6. So there are infinitely many Einstein rings 
(dashed lines) whose angular radius approaches S. One can analytically prove |265l 13141 [53] that 
the divergence of $(9) for 9 — >■ (5 is logarithmic. This is true whenever light rays approach an 
unstable light sphere [15]. The second diagram shows $(9) over a logarithmic 9-axis. The graph 
of <I> approaches a straight line which was called the "strong- field limit" by Bozza et al. [S31 HE] . 
The picture illustrates that it is a good approximation for all light rays that make at least one 
full turn. The third diagram shows cos $(9) over a logarithmic 9-axis. For every source position 
< "i? < TT one can read the position of the images (dotted line). There are infinitely many, 
numbered by their order (l92]) that counts how often the light ray has crossed the axis. Images of 
odd order are on one side of the black hole, images of even order on the other. For the sources at 
•d = TT and i9 = one can read the positions of the Einstein rings. 
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Figure 17: Radial angular diameter distance £'+(8), tangential angular diameter distance -D_(9) 
and travel time T{Q) in the Schwarschild spacetime. The data are the same as in Figure 1161 
For the definition of £)+ and D- see Figure [51 D±{Q) can be calculated from $(0) with the 
help of Equation (|99l) and Equation (|100p . For the Schwarzschild case, the resulting formulas are 
due to jllO) (cf. |llll 1150] ). Zeros of D- indicate Einstein rings. If Z5+ and £>_ have different 
signs, the observer sees a side-inverted image. The travel time T{Q) {— Fermat arclength) can be 
calculated from Equation ([88]). One sees that, over the logarithmic 0-axis used here, the graph of 
T approaches a straight line. This illustrates that T{Q) diverges logarithmically if Q approaches 
its limiting value 5. This can be verified analytically and is characteristic of all cases where light 
rays approach an unstable light sphere [56] . 
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Figure 18: Luminosity distance l?ium(0) and ellipticity e{Q) (image distortion) in the 
Schwarzschild spacetime. The data are the same as in Figures 1161 and 1171 If point sources of 
equal bolometric luminosity are distributed at r = rs, the plotted function 2.51og]^Q (Dium(0)^) 
gives their magnitude on the observer's sky, modulo an additive constant rriQ. For the calculation 
of -Dium one needs and D- (see Figure [TT]), and the general relations (|4T|) and (|48)) . This proce- 
dure follows |110| (cf. |111[ 1150] ). For source and observer at large radius, related calculations can 
also be found in |265[ 13141 1254[ . Einstein rings have magnitude — oo in the ray-optical treat- 
ment. For a light source not on the axis, the image of order i -I- 2 is fainter than the image of order i 
by 2.5 log]^Q(e^'^) w 6.8 magnitudes, see (2651 1314] . (This is strictly true in the "strong-field limit", 
or "strong-deflection limit", which is explained in the caption of Figure [IB] 1 The above picture is 
similar to Figure 6 in |314) . Note that it refers to point sources and not to a radiating spherical 
surface r = rs of constant surface brightness; by Equation (|54p . the latter would show a constant 
intensity. The lower part of the diagram illustrates image distortion in terms of e = — 
Clearly, |e| is infinite at each Einstein ring. The double- logarithmic representation shows that 
beyond the second Einstein ring all images are extremely elongated in the tangential direction, 
|e| > 100. Image distortion in the Schwarzschild spacetime is also treated in |llll 11521 1151j . an 
approximation formula is derived in [308] . 
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angle < n/'i against the normal of the star as measured by a static (Killing) observer. Actually, 
one has to refer not to a static observer but to an observer comoving with the star's surface. This 
modification was worked out by Lake and Roeder |251) . An interesting approximation formula was 
derived by Beloborodov [33]. He showed that a light ray that is emitted at radius rs at an angle 
a with respect to the radial direction escapes to infinity at an angle V': approximately given by 
1 — cosa = (l — cos'0) (l — 2m/rs). As an application, he discusses the light bending of pulsars. An- 
other approximation formula for the bending angle was found by Mutka and Mahohnen [292| 1293] . 

Lensing by a transparent Schwarzschild star. 

To model a transparent star of radius i\ one has to join the exterior Schwarzschild metric at r = 
to an interior (e.g., perfect fluid) metric. Lightlike geodesies of the exterior Schwarzschild metric 
are to be joined to lightlike geodesies of the interior metric when they arrive at r = r*. The 
radius of the star can be time-independent only if > 2m. For 2m < < 3to {uUracompact 
star), the lensing properties for observer and light source in the region r > differ from the 
black hole case only by the possible occurrence of additional images, corresponding to light rays 
that pass through the star. Inside such a transparent ultracompact star, there is at least one 
stable photon sphere, in addition to the unstable one at r = 3m outside the star (cf. jlQl] ). In 
principle, there may be arbitrarily many photon spheres [227] . For > Sm, the lensing prop- 
erties depend on whether there are light rays trapped inside the star. For a perfect fluid with 
constant density, this is not the case; the resulting spacetime is then asymptotically simple, i.e., 
all inextendible light rays come from infinity and go to infinity. General results (see Section 13. 4|) 
imply that then the number of images must be finite and odd. The light cone in this exterior-plus- 
interior Schwarzschild spacetime is discussed in detail by Kling and Newman [236] . (In this paper 
the authors constantly refer to their interior metric as to a "dust" where obviously a perfect fluid 
with constant density is meant.) Effects on light rays issuing from the star's interior have been 
discussed already earlier by Lawrence [257] . The "escape cones", which are shown in Figure fTSl for 
the exterior Schwarzschild metric have been calculated by Jaffe '216' for points inside the star. The 
focal length of a transparent star with constant density is smaller than that of a non-transparent 
star of the same mass and radius. For the mass and the radius of our Sun, one finds 30 au for the 
transparent case, in contrast to the above-mentioned 550 au for the non-transparent case [301] . 
Radiation from a spherically symmetric homogeneous dust star that collapses to a black hole is 
calculated in [379], using kinetic theory. A collapsing inhomogeneous spherically symmetric dust 
configuration may form a naked singularity. Its visual appearance, and other observable features, 
are discussed in |109[ 11031 13061 1295) . This analysis was generalized from the dust case to more 
general matter models in [1691 1102] . 

Lensing by a Schwarzschild white hole. 

To get a Schwarzschild white hole one joins at r = 2m the static Schwarzschild region 2m < r < oo 
to the non-static Schwarzschild region < r < 2m at r = 2m in such a way that outgoing 
light rays can cross this surface but ingoing cannot. In analogy to the gravitational collapse of a 
spherically symmetric star into a black hole, one can consider the outburst of a white hole into 
a spherically symmetric star. The observable effects for an observer in the region r > 2m are 
discussed in [rin EM] |299l [Ml [249] [250] . 

5.2 Kottler spacetime 

The Kottler metric 
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is the unique spherically symmetric solution ol Einstein's vacuum field equation with a cosmological 
constant A. It has the form with 
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It is also known as the Schwarzschild-deSitter metric for A > and as the Schwarzschild-anti- 
deSitter metric for A < 0. The Kottler metric was found independently by Kottler |239j and by 
Weyl [431] . For A = 0, it reduces to the Schwarzschild metric (|105p . 

In the following we consider the Kottler metric with a constant m > and we ignore the region 
r < for which the singularity at r = is naked, for any value of A. For A < 0, there is one 
horizon at a radius rpj with < ru < 2to; the staticity condition e-^''"^ > is satisfied on the 
region r^i < r < oo. For < A < (3m)^^, there are two horizons at radii rm and rH2 with 
2m < rui < 3m < rH2; the staticity condition e^*-''^ > is satisfied on the region thi < r < rH2- 
For A > (3to)^^ there is no horizon and no static region. At the horizon(s), the Kottler metric 
can be analytically extended into non-static regions. For A < 0, the resulting global structure is 
similar to the Schwarzschild case. For < A < (3to)^^, the resulting global structure is more 
complex (see [248] ). The extreme case A = (3to)~^ is discussed in [347) . 

For any value of A, the Kottler metric has a light sphere at r = 3to. Escape cones and embedding 
diagrams for the Fermat geometry (optical geometry) can be found in [3911 1201] (cf. Figures [15] 
and[T2]for the Schwarzschild case). Similarly to the Schwarzschild spacetime, the Kottler spacetime 
can be joined to an interior perfect-fluid metric with constant density. Embedding diagrams for the 
Fermat geometry (optical geometry) of the exterior-plus-interior spacetime can be found in [393] . 
For the optical appearance of a Kottler black hole see [H], and for the optical appearance of a 
Kottler white hole see [249] . The shape of infinitesimally thin light bundles in the Kottler spacetime 
is determined in [111] . 

In view of gravitational lensing, the Kottler metric is of particular interest because it can be 
used to answer the question of how the bending angle of light is affected by a cosmological constant. 
To that end one has to consider the orbits of light rays in the Kottler spacetime and to investigate 
how they differ from the orbits of light rays in the Schwarzschild spacetime. The first person who 
looked into this question was, surprisingly late, Islam in 1983 |212] . He found that the bending 
angle of light is not affected at all by a cosmological constant. His conclusion, which eventually 
turned out to be erroneous, was based on the (correct) observation that the differential equation 
for the orbits of light rays in the Kottler spacetime is exactly the same as in the Schwarzschild 
spacetime. To verify this, it suffices to insert the metric coefficients S{r) and R{r) from Equation 
()114p into Eq. ([5^ . This results in the differential equation 



which is, indeed, independent of A. Hence, the orbits of light rays in the Kottler spacetime are 
given by exactly the same coordinate equations as in the Schwarzschild spacetime. On the basis of 
this result, it was generally accepted for more than two decades that the gravitational bending of 
light is unaffected by a cosmological constant. (See, however, Lake [247] for an interesting caveat, 
as to the question of whether the constant m has the same physical meaning in the Kottler case as 
in the Schwarzschild case.) Only in 2007 was it shown, in a paper by Rindler and Ishak [356] , that 
this conclusion was incorrect. The fact that the coordinate expressions for the orbits of light rays 
in the Kottler spacetime are the same as in the Schwarzschild spacetime does not imply that the 
bending angles are the same. The reason is that physically measured angles differ from coordinate 
angles; the physically measured angles involve the metric, and the metric does depend on A. The 
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analysis of Rindler and Ishak showed that, in contrast to earlier belief, a positive cosmological 
constant would have a diminishing effect on the bending angle. This is in perfect agreement with 
the intuitive idea that a positive cosmological constant has a repelling effect (i.e., that it tends 
to weaken the gravitational attraction). In terms of the Fermat metric (or optical metric), recall 
Equation (|70|) . the Rindler- Ishak result can be rephrased in the following way |177j : The Fermat 
metric of the Kottler spacetime is projectively equivalent to the Fermat metric of the Schwarzschild 
spacetime (i.e., the unparametrized geodesies are the same), but not conformally equivalent (i.e., 
angles are different). Sereno supported (and slightly modified) the results of Rindler and Ishak 
in two papers. In the first one [375] he analyzed the influence of a cosmological constant on the 
bending of light in the weak deflection limit. He found that, in the case of a positive A, the radius 
of an Einstein ring decreases and, in a multiple imaging situation, the images are demagnified 
and the time delay increases. In the second paper |376] he demonstrated that the influence of 
a cosmological constant on the lens equation can be partly (but not completely) absorbed by an 
appropriate redefinition of the angular diameter distance. He argued that, for physical reasons, 
one should express all results in terms of angular diameter distances, rather than in terms of the 
radial Schwarzschild-like coordinate, as in the Rindler-Ishak paper. In the same paper, Sereno also 
calculated the influence of a cosmological constant on the redshifts in a multiple imaging situation. 
Further contributions to the subject were made by Schiicker |369) . In contrast to Sereno, Schiicker 
deliberately avoided any reference to a lens equation; instead, he concentrated on the difference 
between coordinate angles and physical angles. 

The Rindler-Ishak paper has caused a fairly large number of follow-up papers. Although some 
of them were critical, it sems fair to say that, by now, it is generally accepted that a positive 
cosmological constant has a diminishing effect on the bending angle of light. However, there is 
still a controversy about the question of whether this effect is actually observable, in realistic 
astrophysical situations. In order to answer this question, it is not sufficient to analyze the light 
bending in the Kottler metric, which describes the gravitational field around an isolated mass in 
a world with a cosmologal constant. It is rather necessary to take the influence of a cosmological 
background spacetime into account. This has been done by applying the Einstein-Straus method 
with a cosmological constant, i.e., by matching a Kottler vacuole at an outer boundary to a 
Robertson- Walker spacetime. Calculations of light bending in such a composed spacetime were 
undertaken by Ishak et al. [210) . and then, e.g., by Schiicker ^370j . Whereas these papers come to 
the conclusion that the effect of A on the light bending by some galaxy clusters could be observable, 
some other authors feel that this effect is negligibly small, see e.g. [382) . For a recent review article 
on the topic the reader may consult Ishak and Rindler [209] . 



5.3 Reissner— Nordstrom spacetime 

The Reissner-Nordstrom metric 

5 = - (l - ^ + ^) + . + {d^^ + sin^ ^ V) (116) 

is the unique spherically symmetric and asymptotically flat solution of the Einstein-Maxwell equa- 
tions. It has the form (IM)) with 

e2/('-)=5(r)-i = l- — + 4, R{r)= , (117) 

yi - — + 72 

It describes the field around an isolated spherical object with mass m and charge e. The Reissner- 
Nordstrom metric was found independently by Reissner [3541 , Weyl |430j , and Nordstrom |309] . A 
fairly complete list of the pre-1979 literature on Reissner-Nordstrom geodesies can be found in 13801 . 



68 



A detailed account of Reissner-Nordstrom geodesies is given in |75j. (The Reissner-Nordstrom 
spacetime can be modified by introducing a cosmological constant. This generahzed Reissner- 
Nordstrom spacetime, whose global structure is investigated in [256] . will not be considered here.) 

We assume m > and ignore the region r < for which the singularity at r = is naked, for 
any value of e. Two cases are to be distinguished; 

1. < < m?; in this case the staticity condition e-^'-'"' > is satisfied on the regions 
< r < m — V — and m + < r < oo, i.e., there are two horizons. 

2. < e^; then the staticity condition e^*^*"' > is satisfied on the entire region < r < oo, 
i.e., there is no horizon and the singularity at r = is naked. 

By switching to isotropic coordinates, one can describe light propagation in the Reissner-Nordstrom 
metric by an index of refraction (see, e.g., [135J ). The resulting Fermat geometry (optical geometry) 
is discussed, in terms of embedding diagrams for the black-hole case and for the naked-singularity 
case, in |2441 13] (cf. [201]). The visual appearance of a background, as distorted by a Reissner- 
Nordstrom black hole, is calculated in [276]. Lensing by a charged neutron star, whose exterior 
is modeled by the Reissner-Nordstrom metric, is the subject of [HHISl]- The lensing properties 
of a Reissner-Nordstrom black hole are qualitatively (though not quantitatively) the same as that 
of a Schwarzschild black hole. The reason is the following. For a Reissner-Nordstrom black hole, 
the metric coefficient R{r) has one local minimum and no other extremum between horizon and 
infinity, just as in the Schwarzschild case (recall Figure llOp . The minimum of R{r) indicates 
an unstable light sphere towards which light rays can spiral asymptotically, thereby defining the 
"shadow" of a Reissner-Nordstrom black hole. The existence of this minimum, and of no other 
extremum, was responsible for all qualitative features of Schwarzschild lensing. Correspondingly, 
Figures [TBI flTl and 1181 also qualitatively illustrate lensing by a Reissner-Nordstrom black hole. In 
particular, there is an infinite sequence of images for each light source, corresponding to an infinite 
sequence of light rays whose limit curve asymptotically spirals towards the light sphere. One can 
consider the "strong-field limit" [53]|3S] of lensing for a Reissner-Nordstrom black hole, in analogy 
to the Schwarzschild case which is indicated by the asymptotic straight line in the middle graph 
of Figure UM Bozza investigates whether quantitative features of the "strong-field limit", e.g., 
the slope of the asymptotic straight line, can be used to distinguish between different black holes. 
For the Reissner-Nordstrom black hole, image positions and magnifications have been calculated 
in |126j . and travel times have been calculated in |255] . In both cases, the authors use the "almost 
exact lens map" of Virbhadra and Ellis |420| (recall Section 14. 3p and analytical methods of Bozza 
et al. [S3] I3H1 ■ The question of whether the "shadow" of a Reissner-Nordstrom black hole can 
be observationally distinguished from that of a Schwarzschild black hole is discussed in [437]. 

5.4 Morris— Thorne wormholes 

We consider a spacetime whose metric is of the form with e-^^-^^S{r) — 1, i.e., 

g = -e^f^^^df + dr^ + e^f^^'>R{rf {dd^ + sin^ d V) , (US) 
where r ranges from — oo to oo. We call such a spacetime a Morris-Thome wormhole (see [283] ) if 

f[r) ~> 0, r-^R{rf — > 1, (119) 

1 — ^±oo r— )-±oo 

such that the metric (|118p is asymptotically flat for r — > — oo and for r — > oo. 

A particular example of a Morris-Thorne wormhole is the Ellis wormhole |132| where 

f{r) = 0, R{r) = v^r2 + a? (120) 
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with a constant a. The EUis wormhole has an unstable hght sphere at r = 0, i.e., at the "neck" 
of the wormhole. It is easy to see that every Morris-Thorne wormhole must have an unstable 
light sphere at some radius between r = —oo and r = oo. This has the consequence [191] that 
every Morris-Thorne wormhole produces an infinite sequence of images of an appropriately placed 
light source. This infinite sequence corresponds to infinitely many light rays whose limit curve 
asymptotically spirals towards the unstable light sphere. 

Lensing by the Ellis wormhole was discussed in [77J : in this paper the authors identified the 
region r > with the region r < and they developed a scattering formalism, assuming that 
observer and light source are in the asymptotic region. Lensing by the Ellis wormhole was also 
discussed in [339] in terms of the exact lens map. The resulting features are qualitatively very 
similar to the Schwarzschild case, with the radius values r = — oo, r = 0, r = oo in the wormhole 
case corresponding to the radius values r — 2m, r = 3m, r = oo in the Schwarzschild case. With 
this correspondence, Figures [TCI II 7[ and [TH] qualitatively illustrate lensing by the Ellis wormhole. 
More generally, the same qualitative features occur whenever the metric function R{r) has one 
minimum and no other extrema, as in Figure 1101 Lensing by the Ellis wormhole (and other types 
of wormholes) is also discussed in [297] . For a detailed discussion of lensing by Morris-Thorne 
wormholes, including visualizations, see |287|I288] . 

If observer and light source are on the same side of the wormhole's neck, and if only light rays in 
the asymptotic region are considered, lensing by a wormhole can be studied in terms of the quasi- 
Newtonian approximation formalism |235j . However, as wormholes are typically associated with 
negative energy densities |283| 1284) , the usual assumption of the quasi- Newtonian approximation 
formalism that the mass density is positive cannot be maintained. This observation has raised some 
interest in lensing by negative masses, in particular in the question of whether negative masses can 
be detected by their ( "microlensing" ) effect on the energy fiux from sources passing behind them. 
So far, related calculations [86] l361j have been done only in the quasi-Newtonian approximation 
formalism. 

5.5 Barriola— Vilenkin monopole 

The Barriola- Vilenkin monopole [27] is given by the metric 

g = -df + dr'^ + k^r'^idd'^ + sin^ d dip'^), (121) 

with a constant fc < 1. There is a deficit solid angle and a singularity at r = 0; the plane 
t = constant, -d — 7t/2 has the geometry of a cone. (Similarly, for fc > 1 one gets a surplus solid 
angle.) The Einstein tensor has non- vanishing components Gtt = ~Grr = (1 — A:^)/r^. 

The metric \121\ was briefiy mentioned as an example for a conical singularity by Sokolov and 
Starobinsky 384 . Barriola and Vilenkin [27] realized that this metric can be used as a model for 
monopoles that might exist in the universe, resulting from breaking a global 0(3) symmetry. They 
also discussed the question of whether such monopoles could be detected by their lensing properties 
which were characterized on the basis of some approximative assumptions (cf. [108] ). However, 
such approximative assumptions are actually not necessary. The metric ()12ip has the nice property 
that the geodesies can be written explicitly in terms of elementary functions. This allows to write 
down explicit expressions for image positions and observables such as angular diameter distances, 
luminosity distances, image distortion, etc. (see [339] ). Note that because of the deficit angle the 
metric p2ip is not asymptotically flat in the usual sense. (It is "quasi-asymptotically flat" in the 
sense of [312] .) For this reason, the "almost exact lens map" of Virbhadra and Ellis [420] (see 
Section [4?3)) . is not applicable to this case, at least not without modiflcation. 

The metric ()12ip is closely related to the metric of a static string (see metric ()139p with 
a = 0). Restricting metric ()12ip to the hyperplane d ^ tt/2 and restricting metric (1139P with 
a = to the hyperplane z — constant gives the same (2 + l)-dimensional metric. Thus, studying 
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light rays in the equatorial plane of a Barriola-Vilenkin monopole is the same as studying light 
rays in a plane perpendicular to a static string. Hence, the multiple imaging properties of a 
Barriola-Vilenkin monopole can be deduced from the detailed discussion of the string example 
in Section 15.101 In particular Figures [25] and [26] show the light cone of a non-transparent and 
of a transparent monopole if we interpret the missing spatial dimension as circular rather than 
linear. This makes an important difference. While in the string case the cone of Figures 1251 has a 
2-dimensional set of transverse self-intersection points, the corresponding cone for the monopole 
has a 1-dimensional radial caustic. The difference is difficult to visualize in spacetime pictures; it 
is therefore recommendable to use a purely spatial visualization in terms of instantaneous wave 
fronts (intersections of the light cone with hypersurfaces t — constant) (compare Figures fTOl and [20] 
with Figures [28] and [29]) • 




Figure 19: Instantaneous wave fronts in the spacetime of a non-transparent Barriola-Vilenkin 
monopole with k = 0.8. The picture shows in 3-dimensional space the intersections of the past 
light cone of some event with four hypersurfaces t — constant, at values ti > t2 > > t^. Only 
one half of each instantaneous wave front and of the monopole is shown. When the wave front 
passes the monopole, a hole is pierced into it, then a tangential caustic develops. The caustic of 
each instantaneous wave front is a point, the caustic of the entire light cone is a spacelike curve in 
spacetime which projects to part of the axis in 3-space. 
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Figure 20: Instantaneous wave fronts in the spacetime of a transparent Barriola-Vilenkin nionopole 
with k — 0.8. In addition to the tangential caustic of Figure [T9l a radial caustic is formed. For 
each instantaneous wave front, the radial caustic is a cusp ridge. The radial caustic of the entire 
light cone is a lightlike 2-surface in spacetime which projects to a rotationally symmetric 2-surface 
in 3-space. 
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5.6 Janis— Newman— Winicour spacetime 



The Janis-Newman-Winicour metric [217, can be brought into the form [418] 

2to W 2 dr^ (d^^ + sin' ^ d'p') 

( 1 _ 2m ^ 



IT J I 1 — — 

1 7r 



(122) 



where m and 7 are constants. It is the most general sphericahy symmetric static and asymptotically 
flat solution of Einstein's field equation coupled to a massless scalar field. For 7 = 1 it reduces to 
the Schwarzschild solution; in this case the scalar field vanishes. For m > and 7 7^ 1, there is a 
naked curvature singularity at r = 2m/ ^. Lensing in this spacetime was studied in |4221 1421] 1417] , 
The main motivation was to find out whether the lensing characteristics of such a naked singularity 
can be distinguished from lensing by a Schwarschild black hole. The result is that the qualitative 
features of lensing remain similar to the Schwarzschild case as long asl/2<7<l. However, 
if 7 drops below 1/2, they become quite different. The reason is easily understood if we write 
Equation (|122p in the form (|69|) . The metric coefficient 

i?(r)=r(l-^)'" (123) 

has a minimum between the singularity and infinity as long as ^ < 7 < 1 (see Figure I21|) . This 
minimum indicates an unstable light sphere (recall Equation ([TT]) ). as in the Schwarzschild case at 
r = 3m. All qualitative features of lensing carry over from the Schwarzschild case, i.e.. Figures [T6l 
1171 and llSl remain qualitatively unchanged. Clearly, the precise shape of the graph of $ in Figure fTHl 
changes if 7 is changed. The question of how the logarithmic asymptote ( "strong- field limit") 
depends on 7 is dicussed in |48| . If 7 drops below 1/2, R{r) has no longer an extremum, i.e., 
there is no light sphere. Owing to a general result proven in |191j . this implies that only finitely 
many images are possible. In |421j naked singularities of spherically symmetric spacetimes are 
called weakly naked if they are surrounded by a light sphere (cf. [81J). In a nutshell, weakly naked 
singularities show the same qualitative lensing features as black holes. A generalization of this 
result to spacetimes without spherical symmetry has not been worked out so far. 



5.7 Boson and fermion stars 

Spherically symmetric static solutions of Einstein's field equation coupled to a scalar field may 
be interpreted as (uncharged, non-rotating) boson stars if they are free of singularities. Because 
of the latter condition, the Janis-Newman-Winicour metric (see Section 15. 6p does not describe a 
boson star. The theoretical concept of boson stars goes back to [229| I359j . The analogous idea 
of a fermion star, with the scalar field replaced by a spin 1/2 (neutrino) field, is even older |270j . 
Until today there is no observational evidence for the existence of either a boson or a fermion 
star. However, they are considered, e.g., as hypothetical candidates for supermassive objects at 
the center of galaxies (see |371[ 1405] for the boson and |416l 1406] for the fermion case) . For the 
supermassive object at the center of our own galaxy, evidence points towards a black hole, but the 
possibility that it is a boson or fermion star cannot be completely excluded so far. 

Exact solutions that describe boson or fermion stars have been found only numerically (in 
3-1-1 dimensions) . For this reason there is no boson star model for which the lightlike geodesies 
could be studied analytically. Numerical studies of lensing have been carried out by Dabrowski 
and Schunck [93, for a transparent spherically symmetric static maximal boson star, and by Bilic, 
Nikolic, and Viollier [JD] for a transparent spherically symmetric static maximal fermion star. For 
the case of a fermion- fermion star (two components) see |220j . In all three articles the authors use 
the "almost exact lens map" of Virbhadra and Ellis (see Section 14. 3p which is valid for observer 
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Figure 21: Metric coefficient R{r) for tfie Janis-Newman-Winicour metric. For -i < 7 < 1, R{r) is 
similar to the Schwarzschild case 7 = 1 (see Figure [TU]) . For 7 < 5, Rif) has no longer a minimum, 
i.e., there is no longer a light sphere which can be asymptotically approached by light rays. 
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and light source in the asymptotic region and almost aligned. D§,browski and Schunck |93j also 
discuss how the alignment assumption can be dropped. The lensing features found in [33] for 
the boson star and in }40j for the fermion star have several similarities. In both cases, there is a 
tangential caustic and a radial caustic (recall Figure |9] for terminology). A (point) source on the 
tangential caustic (i.e., on the axis of symmetry through the observer) is seen as a (1-dimenional) 
Einstein ring plus a (point) image in the center. If the (point) source is moved away from the 
axis the Einstein ring breaks into two (point) images, so there are three images altogether. Two 
of them merge and vanish if the radial caustic is crossed. So the qualitative lensing features are 
quite different from a Schwarzschild black hole with (theoretically) infinitely many images (see 
Section 15. ip . The essential difference is that in the case of a boson or fermion star there are no 
circular lightlikc geodesies towards which light rays could asymptotically spiral. 



5.8 Kerr spacetime 

Next to the Schwarzschild spacetime, the Kerr spacetime is the physically most relevant example 
of a spacetime in which lensing can be studied explicitly in terms of the lightlike geodesies. The 
Kerr metric is given in Boyer-Lindquist coordinates (r, i?, iy9, t) as 

/ rjf'^ \ '2,7717' 9 

g = g{r, ---- +(0)^] + (r^ + a^) sin^ d(p^ - df + , (a sin^ § dip - dt) , (124) 
\A(r) J g[r,v)'^ ' 

where q and A are defined by 

Q{r,'df = r^ + CO'S^ -d, A{r) ^ r'^ - 2mr + a"^ , (125) 

and m and a are two real constants. We assume < a < m, with the Schwarzschild case a = 
and the extreme Kerr case a = m as limits. Then the Kerr metric describes a rotating uncharged 
black hole of mass m and specific angular momentum a. (The case a > m, which describes a 
naked singularity, will be briefly considered at the end of this section.) The domain of outer 
communication is the region between the (outer) horizon at 

rj^ = m+ \/rn?—a? (126) 

and r = oo. It is joined to the region r < r+ in such a way that past-oriented ingoing lightlike 
geodesies cannot cross the horizon. Thus, for lensing by a Kerr black hole only the domain of outer 
communication is of interest unless one wants to study the case of an observer who has fallen into 
the black hole. The lensing properties of a Kerr black hole will be reviewed below. For the effect 
of a Kerr black hole on the propagation of the polarization plane of light (cf. Section 12. 5|) see, 
e.g., [T78]IM5l[T42l[m]l3Tnl[374] . 



Historical notes. 

The Kerr metric was found by Kerr |234| . The coordinate representation (I124p is due to Boyer and 
Lindquist ^7] . The literature on lightlike (and timelike) geodesies of the Kerr metric is abundant 
(for an overview of the pre-1979 literature, see Sharp |380j ). Detailed accounts on Kerr geodesies 
can be foimd in the books by Chandrasekhar [7S] and O'Neill |316j . 

Fermat geometry. 

The Killing vector held dt is not timelike on that part of the domain of outer communication where 
Q{r, df < 2mr. This region is known as the ergosphere. Thus, the general results of Section [4. 2 1 on 
conformally stationary spacetimes apply only to the region outside the ergosphere. On this region, 
the Kerr metric is of the form (I5T|) . with redshift potential 

e^^(-^) = 1 - (127) 
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Fermat metric 



e(r,i?)2-2mr VA(r) ) {g{r,^y-2mr 

and Fermat one-form 

^ 2mra sin^ 1? (129) 
p(r, i?)2 — 2mr 

(Equation (|128p corrects a misprint in [333) . Equation (66), where a square is missing.) With the 
Fermat geometry at hand, the optical-mechanical analogy (Fermat's principle versus Maupertuis' 
principle) allows to write the equation for light rays in the form of Newtonian mechanics (cf. [7]). 
Certain embedding diagrams for the Fermat geometry (optical reference geometry) of the equato- 
rial plane have been constructed |390l 1201] . However, they are less instructive than in the static 
case (recall Figure [T2|) because they do not represent the light rays as geodesies of a Riemannian 
manifold. 

First integrals for lightlike geodesies. 

Carter ,74, discovered that the geodesic equation in the Kerr metric admits another independent 
constant of motion K, in addition to the constants of motion L and E associated with the Killing 
vector fields dip and dt- This reduces the lightlike geodesic equation to the following first-order 
form: 

, , , , (r'^ + a'^) ((r'^ + a'^)E ~ aL) 

g(r,^)h = a(L-Easm^d) + - , 130 

^ ' LS.(r) 

, on2 ■ -Basinet? {r"^ + a^)aE - L 
Q{r. - -t- ^ T^73 , (131) 



sm 



•d A(r) 



,ir,,f^^ = K^ ^'^-^.rf'^\ (132) 
sm 17 

gir, i?)V2 = -KA{r) + ((r^ + a^)E - ahf . (133) 

Here an overdot denotes differentiation with respect to an affine parameter s. This set of equations 
allows writing the lightlike geodesies in terms of elliptic integrals [22]. Clearly, d and r may 
change sign along a ray; thus, the integration of Equation (|132p and Equation (|133|) must be 
done piecewise. The determination of the turning points where § and r change sign is crucial for 
numerical evaluation of these integrals and, thus, for ray tracing in the Kerr spacetime (see, e.g., 
|41H 1353] I138j ). With the help of Equations ()132l I133|) one easily verifies the following important 
fact (cf. |192j ). Through each point of the region 

/C : (2rA(r) ~ {r - m) g{r, dff < Aa^r^l:\{r) sir? d (134) 

there is spherical light ray, i.e., a light ray along which r is constant (see Figure[22|). These spherical 
light rays are unstable with respect to radial perturbations. For the spherical light ray at radius 
r-p the constants of motion E^ L, and K satisfy 

L T o 2rr,A(r,,) 
a-=rl + a^- ^ ^ ^' , 135 
E '■ r-p ~ m 

E^ (rp — m)'^ 

The region K. is the Kerr analogue of the "light sphere" r = 3m in the Schwarzschild spacetime. 
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TO 2m 3to 4to 5to 



Figure 22: The region /C, defined by Equation (|134|) . in the Kerr spacetime. This picture, which 
can also be found in [192) . is purely spatial and shows a meridional section ip = constant, with 
the axis of symmetry at the left-hand boundary. Through each point of /C there is a spherical 
geodesic. Along each of these spherical geodesies, the coordinate oscillates between extremal 
values, corresponding to boundary points of /C, whereas the coordinate ip proceeds according to 
Eq. (|13ip . The region /C meets the axis at radius r^, given by — Smr^ + a^r^ + ma? = 0. 
Its boundary intersects the equatorial plane in circles of radius (corotating circular light ray) 
and (counter- rotating circular light ray), are determined by r^{r^ — 3to)^ — Ama? and 
r+ < < 3m < < 4to. In the Schwarzschild limit a — !■ the region JC shrinks to the light 
sphere r = 3m. In the extreme Kerr limit a ^ m the region /C extends to the horizon because 
in this limit both r'^ — >■ m and r+ m. For a caveat, as to geometric misinterpretations of this 
limit, see Figure 3 in [22] , 
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Light cone. 

With the help of Equations (|1301 11311 1132[[T55|) . the past hght cone of any observation event po can 
be expHcitly written in terms of eUiptic integrals. In this representation the light rays are labeled 
by the constants of motion L/E and K/E'^. In accordance with the general idea of observational 
coordinates (jlj, it is desirable to relabel them by spherical coordinates (^',6) on the observer's 
celestial sphere. This requires choosing an orthonormal tetrad (eo, ei, 62, 63) at po- It is convenient 
to choose ei oc 9^, 62 oc 63 cx dr and, thus, eo perpendicular to the hypersurface t = constant 
( "zero-angular-momentum observer" ) . For an observation event in the equatorial plane, = 7''/2, 
at radius tq, one finds 



As in the Schwarzschild case, some light rays from pq go out to infinity and some go to the horizon. 
In the Schwarzschild case, the borderline between the two classes corresponds to light rays that 
asymptotically approach the light sphere at r = 3m. In the Kerr case, it corresponds to light rays 
that asymptotically approach a spherical light ray in the region K, of Figure [22l The constants 
of motion for such light rays are given by Equation (|135[ 11361) , with rp varying between its ex- 
tremal values and (see again Figure [22t . Thereupon, Equation (|137p and Equation (|138l) 
determine the celestial coordinates 5* and Q of those light rays that approach a spherical light 
ray in K.. The resulting curve on the observer's celestial sphere gives the apparent shape of the 
Kerr black hole (see Figure [23]). For an observation event on the axis of rotation, sini9o = 0, the 
Kerr light cone is rotationally symmetric. The caustic consists of infinitely many spacelike curves, 
as in the Schwarzschild case. A light source passing through a point of the caustic is seen as an 
Einstein ring. For observation events not on the axis, the light cone has no rotational symmetry 
and the caustic structure is quite different from the Schwarzschild case. The caustic still consists 
of infinitely many connected subsets (a primary caustic and infinitely many higher-order caustics), 
but these are no longer spacelike curves. This fact is somewhat disguised if one restricts to light 
rays in the equatorial plane d = tt/2 (which is possible, of course, only if the observation event 
is in the equatorial plane). Then the resulting 2-dimensional light cone looks indeed qualitatively 
similar to the Schwarzschild cone of Figure [13] (cf. |184j ). where intersections of the light cone with 
hypersurfaces t — constant are depicted. However, in the Kerr case the transverse self-intersection 
of this 2-dimensional light cone does not occur on an axis of symmetry. Therefore, the caustic of 
the full (3-dimensional) light cone is more involved than in the Schwarzschild case. The primary 
caustic turns out to be not a spatially straight line, as in the Schwarzschild case, but rather a tube, 
with astroid cross-section, that winds a certain number of times around the black hole; for a — m 
it approaches the horizon in an infinite spiral motion. The primary caustic of a Kerr light cone, 
with vertex in the equatorial plane far away from the black hole, was numerically calculated and 
depicted, for a = m, by Rauch and Blandford [353] . A detailed study of primary and higher-order 
caustics, for a Kerr light cone with vertex far away from the black hole but not necessarily in the 
equatorial plane, was presented by Bozza [51]. This work, which contains several pictures of Kerr 
caustics in 3-1-0 dimensions, is based on numerical calculations. The results are in good agreement 
with analytical approximation methods for studying the caustics. Two such methods exist which 
are complementary to each other in the sense that the first is valid for light rays that come close 
to a spherical light ray in the region /C and the second is valid for light rays that stay far away 
from the black hole: The first method is due to Bozza, de Luca, Scarpetta and Sereno [55) [57] 
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who analytically studied higher-order caustics in the strong deflection limit; this approach is not 
applicable to the primary caustic. The second method is due to Sereno and de Luca [377] who 
developed an analytic formula for the primary caustic that is valid up to fourth order in m/h and 
a/6, where h is the impact parameter. Taking all this together, a fairly clear picture of the caustics 
of Kerr light cones has now emerged. Also, attempts have been made to visualize the Kerr light 
cones in terms of their intersections with hypersurfaces t = constant, see Figure 1 in [182] ■ From 
the study of light cones one may switch to the study of arbitrary wave fronts. (For the definition 
of wave fronts see Section [221) Pretorius and Israel |351| determined all axisymmetric wave fronts 
in the Kerr geometry. In this class, they investigated in particular those members that are asymp- 
totic to Minkowski light cones at infinity ("quasi-spherical light cones") and they found, rather 
surprisingly, that they are free of caustics. Special families of wave fronts in the Kerr spacetime 
are also considered, e.g., in [145| I194[ [TT ] . 

Lensing by a Kerr black hole. 

For an observation event po and a light source with worldline 73 , both in the domain of outer com- 
munication of a Kerr black hole, several qualitative features of lensing are unchanged in comparison 
to the Schwarzschild case. If 73 is past-inextendible, bounded away from the horizon and from 
(past lightlike) infinity, and does not meet the caustic of the past light cone of po , the observer 
sees an infinite sequence of images; for this sequence, the travel time (e.g., in terms of the time 
coordinate t) goes to infinity. These statements have been proven in i l92 with the help of Morse 
theory (cf. Section [373]) . On the observer's sky the sequence of images approaches the apparent 
boundary of the black hole which is shown in Figure [23l This follows from the fact that 

• the infinite sequence of images must have an accumulation point on the observer's sky, by 
compactness, and 

• the lightlike geodesic with this initial direction cannot go to infinity or to the horizon, by 
assumption on 73. 

If 7s meets the caustic of po's past light cone, the image is not an Einstein ring, unless po is 
on the axis of rotation. It has only an "infinitesimal" angular extension on the observer's sky. 
As always when a point source meets the caustic, the ray-optical calculation gives an infinitely 
bright image. Numerical studies show that in the Kerr spacetime, where the caustic is a tube with 
astroid cross-section, the image is very bright whenever the light source is inside the tube [353] . 
In principle, with the lightlike geodesies given in terms of elliptic integrals, image positions on the 
observer's sky can be calculated explicitly. This has been worked out for several special wordlines 
7s. The case that 7s is a circular timelike geodesic in the equatorial plane of the extreme Kerr 
metric, a = m, was treated by Cunningham and Bardeen [90| 123] . This example is of relevance 
in view of accretion disks. Viergutz |411] developed a formalism for the case that 7s has constant 
r and 1? coordinates, i.e., for a light source that stays on a ring around the axis. One aim of 
this approach, which could easily be rewritten in terms of the exact lens map (recall Section 12. ip , 
was to provide a basis for numerical studies. The case of a stationary light source (i.e., the case 
that 7s is an integral curve of dt) was investigated in great detail in a series of papers by Bozza, 
de Luca, Scarpctta and Sereno [49l [55j [lU \57\ . In all these papers the authors derive analytic 
approximation formulas using the strong-deflection limit, i.e., the approximation is good for light 
rays that undergo a deflection of tt or more. Such light rays come close to one of the spherical 
light rays in the region /C, recall Figure [22| The flrst two papers in the series make the additional 
assumption that the light source and the observer are in the equatorial plane and that not only 
the observer but also the light source is far away from the black hole; in the last two papers these 
assumptions are relaxed. This series of papers gives a fairly complete analysis of Kerr lensing for 
stationary light sources under the strong deflection hypothesis. An alternative approach to Kerr 
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a = m/3 , ro = 3 m 



a = m/3 , ro = Qrn 



a = m/3 ro = 12 m 




a = m, To = irn a = m, ro = 6m a = m, tq = ^'irn 



Figure 23: Apparent shape of a Kerr black hole for an observer at radius ro in the equatorial 
plane. (For the Schwarzschild analogue, see Figure ITSl) The spin vector of the black hole is pointing 
downwards. The pictures show the celestial sphere of an observer whose 4- velocity is perpendicular 
to a hypersurface t — constant. (If the observer is moving one has to correct for aberration.) The 
dashed circle is the celestial equator, Q = 7r/2, and the crossing axes indicate the direction towards 
the center, = tt. Past-oriented light rays go to the horizon if their initial direction is in the 
black disk and to infinity otherwise. Thus the black disk, known as the "shadow" of the black 
hole, shows that part of the sky which is not illuminated by light sources at a large radius. The 
boundary of this disk corresponds to light rays that asymptotically approach a spherical light ray 
in the region JC of Figure [22] For an observer in the equatorial plane at infinity, the shadow of a 
Kerr black hole was correctly calculated and depicted by Bardeen [35] (cf. [7S], p. 358). Earlier 
work by Godfrey |178| contains a mathematical error. Observability of the shadow of a Kerr black 
hole is discussed, e.g., in [l37 | l438 ] [T99 ] . 
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lensing with stationary light sources, partly based on numerical results, was brought forward by 
Vazquez and Esteban |410) . All these articles also calculate the brightness of images. This requires 
determining the cross-section of infinitesimally thin bundles with a vertex, e.g., in terms of the 
shape parameters -D+ and D- (recall Figure |3]). For a bundle around an arbitrary light ray in 
the Kerr metric, all relevant equations were worked out analytically by Pineault and Roeder [345]. 
However, the equations are much more involved than for the Schwarzschild case and will not 
be given here. Lensing by a Kerr black hole has been visualized (i) by showing the apparent 
distortion of a background pattern |346l 1381] and (ii) by showing the visual appearence of an 
accretion disk |346l 13501 13811 131] . The main difference, in comparison to the Schwarzschild case, is 
in the loss of the left-right symmetry. In view of observations, Kerr black holes are considered as 
candidates for active galactic nuclei (AGN) since many years. In particular, the X-ray variability 
of AGN is interpreted as coming from a "hot spot" in an accretion disk that circles around a 
Kerr black hole. Starting with the pioneering work in |90[ 123] . many articles have been written on 
calculating the light curves and the spectrum of such "hot spots" , as seen by a distant observer (see, 
e.g., [98l [151 12251 12181 1138] ). The spectrum can be calculated in terms of a transfer function that 
was tabulated, for some values of a, in [88] (cf. |41H 1412] ). A Kerr black hole is also considered 
as the most likely candidate for the supermassive object at the center of our own galaxy. (For 
background material see |136l .) In this case, the predicted angular diameter of the black hole on 
our sky, in the sense of Figure [531 is about 30 microarcseconds; this is not too far from the reach 
of current VLBI technologies |137] . Also, the fact that the radio emission from our galactic center 
is linearly polarized gives a good motivation for calculating polarimetric images as produced by a 
Kerr black hole [64] . The calculation is based on the geometric-optics approximation according to 
which the polarization vector is parallel along the light ray. In the Kerr spacetime, this parallel- 
transport law can be explicitly written with the help of constants of motion [53] 13451 1395) (cf. [7S] , 
p. 358). As to the large number of numerical codes that have been written for calculating imaging 
properties of a Kerr black hole the reader may consult |226l I411| 13531 1138] . 

Notes on Kerr naked singularities. 

The Kerr metric with a > m describes a naked singularity. Until now there is no observational 
indication that such objects exist in nature. The lightlike geodesies in a Kerr spacetime with a > m 
have been studied in [BHIIZO] (cf. [7S], p. 375). Observable effects of accretion disks around a Kerr 
naked singularity, in comparison to a Kerr black hole, were discussed in |400] . The "shadow" of a 
Kerr naked singularity was calculated in [1001 1199] and, under different assumptions, in T9 . 

Notes on the Kerr— Newman spacetime. 

The Kerr-Newman spacetime (charged Kerr spacetime) is usually thought to be of little astro- 
physical relevance because the net charge of celestial bodies is small. For the lightlike geodesies in 
this spacetime the reader may consult [69 , 71 . Embeddability diagrams of the equatorial plane of 
a Kerr-Newman spacetime can be found in [392] . The shadow of a Kerr-Newman black hole, and 
of a Kerr-Newman naked singularity, was discussed in [100! 1399] . A Morse-theoretical analysis of 
lensing in the Kerr-Newman spacetime can be found in [192j . 

5.9 Rotating disk of dust 

The stationary axisymmetric spacetime around a rigidly rotating disk of dust was first studied in 
terms of a numerical solution to Einstein's field equation by Bardeen and Wagoner ^24l [25] . The 
exact solution was found much later by Neugebauer and Meinel [303] . It is discussed, e.g., in [302] . 
The metric cannot be written in terms of elementary functions because it involves the solution 
to an ultraelliptic integral equation. It depends on a parameter /z which varies between zero and 
/ic — 4.62966 .... For small ji one gets the Newtonian approximation, for /x — >■ /ic the extreme 
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Kerr metric (a — m) is approached. The lightlike geodesies in this spacetime have been studied 
numerically and the appearance of the disk to a distant observer has been visualized |429| . It would 
be desirable to support these numerical results with exact statements. From the known properties 
of the metric, only a few qualitative lensing features of the disk can be deduced. As Minkowski 
spacetime is approached for /i 0, the spacetime must be asymptotically simple and empty as 
long as n is sufficiently small. (This is true, of course, only if the disk is treated as transparent.) 
The general results of Section [33] imply that in this case the gravitational field of the disk produces 
finitely many images of each light source, and that the number of images is odd, provided that 
the worldline of the light source is past-inextendible and does not go out to past lightlike infinity. 
For larger values of fx, this is no longer true. For /i > 0.5 there are two counter-rotating circular 
lightlike geodesies in the equatorial plane, a stable one at a radius pi inside the disk and an unstable 
one at a radius p2 outside the disk. (This follows from 13 where it is shown that for fi > 0.5 
timelike counter-rotating circular geodesies do not exist in a radius interval [pi , P2] • The boundary 
values of this interval give the radii of lightlike circular geodesies.) The existence of circular light 
rays has the consequence that the number of images must be infinite; this is obviously true if light 
source and observer are exactly on the spatial track of such a circular light ray and, by continuity, 
also in a neighborhood. For a better understanding of lensing by the disk of dust it is desirable 
to investigate, for each value of p and each event po^ Which past-oriented lightlike geodesies that 
issue from po go out to infinity and which are trapped? Also, it is desirable to study the light 
cones and their caustics. 

5.10 Straight spinning string 

Cosmic strings (and other topological defects) are expected to exist in the universe, resulting from 
a phase transition in the early universe (see, e.g., |415l for a detailed account). So far, there is 
no direct observational evidence for the existence of strings. In principle, they could be detected 
by their lensing effect. The general perspective is discussed in f207 1 11641 1282] . The object CSL-1, 
which consists of a pair of galaxies, was discussed as a candidate for lensing by a string for some 
time [364] . However, more recent observations by the Hubble Space Telescope led to the conclusion 
that it is not a lensed image [363]. 

Basic lensing features for various string configurations are briefly summarized in p^. Here we 
consider the simple case of a straight string that is isolated from all other masses. This is one 
of the most attractive examples for investigating lensing from the spacetime perspective without 
approximations. In particular, studying the light cones in this metric is an instructive exercise. 
The geodesic equation is completely integrable, and the geodesies can even be written explicitly in 
terms of elementary functions. 

We consider the spacetime metric 

g = -{dt -adf 'f + dz^ + dp^ + k^p^ dip^ , (139) 

with constants a and fc > 0. As usual, the azimuthal coordinate is defined modulo 2tt. For a — Q 
and fc = 1, metric (|139p is the Minkowski metric in cylindrical coordinates. For any other values of 
a and fc, the metric is still (locally) flat but not globally isometric to Minkowski spacetime; there 
is a singularity along the z-axis. For a = and < fc < 1, the plane t — constant, z = constant 
has the geometry of a cone with a deficit angle 

(5 = (1 - fc)27r. (140) 

(see Figure [24]) : for fc > 1 there is a surplus angle. Note that restricting the metric ()139[) with 
a = to the hyperplane z = constant gives the same result as restricting the metric (|12ip of the 
Barriola-Vilenkin monopole to the hyperplane d = 7r/2. 
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The metric (|139p describes the spacetime around a straight spinning string. The constant k is 
related to the string's mass-per-length /i, in Planck units, via 



fc = 1 - 



(141) 



whereas the constant a is a measure for the string's spin. Equation (jl41l) shows that we have 
to restrict to the deficit-angle case fc < 1 to have fi positive. One may treat the string as a line 
singularity, i.e., consider the metric (|139p for all p > 0. (This "wire approximation", where the 
energy-momentum tensor of the string is concentrated on a 2-dimensional submanifold, is math- 
ematically delicate; see [167] .) For a string of finite radius one has to match the metric (|139p 
at p = to an interior solution, thereby getting a metric that is regular on all of R^. In view of 
lensing it is important to distinguish between a transparent string, where light rays are allowed to 
pass through the interior solution, and a non-transparent string, where light rays are blocked at 
the boundary of the string. 



Figure 24: On a cone with deficit angle < S < w, the point p can be connected to every point q 
in the double- imaging region (shaded) by two geodesies and to a point in the single-imaging region 
(non-shaded) by one geodesic. 

Historical notes. 

With a = 0, the metric (|139p and its geodesies were first studied by Marder [267| 1268] , He 
also discussed the matching to an interior solution, without, however, associating it with strings 
(which were no issue at that time). The same metric was investigated by Sokolov and Starobin- 
sky |384j as an example for a conic singularity. Later Vilenkin ^413, ,414i showed that within the 
linearized Einstein theory the metric (|139p with a — describes the spacetime outside a straight 
non-spinning string. Hiscock |200) . Gott |180) . and Linet f261' realized that the same is true in 
the full (non-linear) Einstein theory. Basic features of lensing by a non-spinning string were found 
by Vilenkin [414] and Gott |180j . The matching to an interior solution for a spinning string, 
a 7^ 0, was worked out by Jensen and Soleng [219] . Already earlier, the restriction of the met- 
ric (|139|) with a 7^ to the hyperplane z = was studied as the spacetime of a spinning particle 
in 2 -I- 1 dimensions by Deser, Jackiw, and 't Hooft [lOlj . The geodesies in this (2 -|- l)-dimensional 
metric were first investigated by Clement [82] (cf. Krori, Goswami, and Das [245] for the (3-1- 1)- 
dimensional case). For geodesies in string metrics one may also consult Galtsov and Masar [162] , 
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The metric (|139|) can be generalized to the case of several parallel strings (see Letelier |259j for 
the non-spinning case, and Krori, Goswami, and Das [245] for the spinning case). Clarke, Ellis 
and Vickers [80 found obstructions against embedding a string model close to metric (|139p into 
an almost-Robertson-Walker spacetime. This is a caveat, indicating that the lensing properties of 
"real" cosmic strings might be significantly different from the lensing properties of the metric (|139|) . 

Redshift and Fermat geometry. 

The string metric (I139P is stationary, so the results of Section l¥^ applv. Comparison of metric (|139p 
with metric (|6T|) shows that the redshift potential vanishes, / = 0. Hence, observers on i-lines see 
each other without redshift. The Fermat metric g and Fermat one- form (p read 

g = dz^ + dp^ + Pp'^dip^, (142) 
$ = ~ad(p. (143) 

As the Fermat one-form is closed, d<j) ~ 0, the spatial paths of light rays are the geodesies of the 
Fermat metric g (cf. Equation (|64p ). i.e., they are not affected by the spin of the string, cj) can be 
transformed to zero by changing from t to the new time function t — aip. Then the influence of 
the string's spin on the travel time (j62l) vanishes as well. However, the new time function is not 
globally well-behaved (if a 7^ 0), because ip is either discontinuous or multi- valued on any region 
that contains a full circle around the z-axis. As a consequence, (f) can be transformed to zero on 
every region that does not contain a full circle around the z-axis, but not globally. This may be 
viewed as a gravitational analogue of the Aharonov-Bohm effect (cf. |385j ). The Fermat met- 
ric (|142p describes the product of a cone with the z-line. Its geodesies (spatial paths of light rays) 
are straight lines if we cut the cone open and flatten it out into a plane (see Figure IM)) . The metric 
of a cone is (locally) flat but not (globally) Euclidean. This gives rise to another analogue of the 
Aharonov-Bohm effect, to be distinguished from the one mentioned above, which was discussed, 
e.g., in [T461 [381 1195]. 

Light cone. 

For the metric (jl39l) , the lightlike geodesies can be explicitly written in terms of elementary func- 
tions. One just has to apply the coordinate transformation (t, ip) 1 — > {t — a(p, kip) to the lightlike 
geodesies in Minkowski spacetime. As indicated above, the new coordinates are not globally well- 
behaved on the entire spacetime. However, they can be chosen as continuous and single-valued 
functions of the affine parameter s along all lightlike geodesies through some chosen event, with tp 
taking values in M. In this way we get the following representation of the lightlike geodesies that 
issue from the observation event {p — po, p ~ 0, z = 0,t — 0) into the past: 



p[s) = Y sin^ 8 -I- 2spo sin 8 cos * + (144) 

, s sin8 sin^ , 

tan (kpis)) = — — -, 145 

Po + s sm 8 cos W 

z(s) = s cos 8, (146) 
t{s) = -s + ap{s). (147) 

The affine parameter s coincides with g-arclength and (5*, 8) parametrize the observer's celestial 
sphere, 

''cos 5* sin 8^ 

sin* sin8 I . (148) 



^ jpis) cos p{s) 
p{s) sin p{s) 
zis) 



ds 



s=0 



cos 8 



Equations (|144[ 11451 11461 1147P give us the light cone parametrized by (s, 8, 4*). The same equa- 
tions determine the intersection of the light cone with any timelike hypersurface (source surface) 
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and thereby the exact lens map in the sense of Frittelh and Newman |154j (reeah Section \2.1^ . 
For k = 0.8 and a — 0, the hght cone is depicted in Figure [551 intersections of the hght cone 
with hypersurfaces t — constant ( "instantaneous wave fronts" ) are shown in Figure 1281 In both 
pictures we consider a non-transparent string of finite radius p,, i.e., the hght rays terminate if 
they meet the boundary of the string. Figures [26] and [29| show how the hght cone is modified if 
the string is transparent. This requires matching the metric (|139|) to an interior solution which is 
everywhere regular and letting light rays pass through the interior. For the non-transparent string, 
the light cone cannot form a caustic, because the metric is fiat. For the transparent string, light 
rays that pass through the interior of the string do form a caustic. The special form of the interior 
metric is not relevant. The caustic has the same features for all interior metrics that monotonously 
interpolate between a regular axis and the boundary of the string. Also, there is no qualitative 
change of the light cone for a spinning string as long as the spin a is small. Large values of a, 
however, change the picture drastically. For > k^p^, where p» is the radius of the string, the 
iy9-lines become timelike on a neighborhood of the string. As the iy9-lines are closed, this indicates 
causality violation. In this causality-violating region the hypersurfaces t = constant are not ev- 
erywhere spacelike and, in particular, not transverse to all lightlike geodesies. Thus, our notion of 
instantaneous wave fronts becomes pathological. 




cut locus 



Figure 25: Past light cone of an event po in the spacetime of a non-transparent string of finite 
radius p* with k = 0.8 and a = 0. The metric (|139|) is considered on the region p > p^,, and the 
light rays are cut if they meet the boundary of this region. The z coordinate is not shown, the 
vertical coordinate is time t. The "chimney" indicates the region p < p^ which is occupied by the 
string. The light cone has no caustic but a transverse self- intersection (cut locus). The cut locus, 
in the (2 -I- l)-dimensional picture represented as a curve, is actually a 2-dimensional spacelike 
submanifold. When passing through the cut locus, the lightlike geodesies leave the boundary of 
the chronological past I~{po). Note that the light cone is not a closed subset of the spacetime. 
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caustic 



Figure 26: Past liglit cone of an event po in the spacetime of a transparent string of finite radius 
with k — 0.8 and a = 0. The metric (I139P is matched at p = to an interior metric, 
and light rays are allowed to pass through the interior region. The perspective is analogous to 
Figure The light rays which were blocked by the string in the non-transparent case now form 
a caustic. In the (2 + l)-dimensional picture the caustic consists of two lightlike curves that meet 
in a swallow-tail point (see Figure [27] for a close-up). Taking the z-dimension into account, the 
caustic actually consists of two lightlike 2-manifolds (fold surfaces) that meet in a spacelike curve 
(cusp ridge). The third picture in Figure [2] shows the situation projected to 3-space. Each of the 
past-oriented lightlike geodesies that form the caustic first passes through the cut locus (transverse 
self- intersection), then smoothly slips over one of the fold surfaces. The fold surfaces are inside the 
chronological past I^{po), the cusp ridge is on its boundary. 
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Figure 27: Close-up of the caustic of Figure [251 Tlie string is not shown. Taking the z-dimension 
into account, the swahow-tail point is actually a spacelike curve (cusp ridge). 
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Figure 28: Instantaneous wave fronts in the spacetime of a non-transparent string of finite radius 
with fc = 0.8 and a = 0. The picture shows in S-dimensional space the intersections of the hght cone 
of Figure E5] with three hypersurfaces t = constant, at values ii > ^2 > ^3- The vertical coordinate 
is the z-coordinate which was suppressed in Figure [IHl Only one half of each instantaneous wave 
front is shown so that one can look into its interior. There is a transverse self-intersection (cut 
locus) but no caustic. 
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t = h 




Figure 29: Instantaneous wave fronts in the spacetime of a transparent string of finite radius 
with fc = 0.8 and a = 0. The picture is related to Figure as Figure [25] is related to Figure 
Instantaneous wave fronts that have passed through the string have a caustic, consisting of two 
cusp ridges that meet in a swallow-tail point. This caustic is stable (see Section [2T2|) . The caustic 
of the light cone in Figure [26] is the union of the caustics of its instantaneous wave fronts. It 
consists of two fold surfaces that meet in a cusp ridge, like in the third picture of Figure [2] 
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Lensing by a non-transparent string. 

With the lightlilce geodesies known in terms of elementary functions, positions and properties of 
images can be explicitly determined without approximation. We place the observation event at 
p = po, (fi = 0, z = 0, t = 0, and we consider a light source whose worldline is a t-Vme at p = ps, 
(f = (fis, z = ZS with < (fis < T^- From Equations (|1441 1145| I146P we find that the images are in 
one-to-one correspondence with integers n such that 

|<y9S + 2n7r| < 7r/fc. (149) 

They can be numbered by the winding number n in the order n = 0,— 1,1,— 2, 2,... The total 
number of images depends on k. Let Ni{k) be the largest integer and N2{k) be the smallest integer 
such that Ni{k) < 1/k < N2{k). Of the two integers Ni{k) and N2{k), denote the odd one by 
■^odd(fc) and the even one by Ncvcn{k). Then we find from Equation (|149|) 

0<ips < I^Vovcn - l/fc| TT : A^odd(fc) images, (150) 
l^cvon - TT < cps <TT ■■ A^cvcn(fc) images. (151) 

Thus, the number of images is even in a wedge-shaped region behind the string and odd everywhere 
else. If the light source approaches the boundary between the two regions, one image vanishes 
behind the string (see Figure [U] for the case 1 < 1/A; < 2). (If the non-transparent string has 
finite thickness, there is also a region with no image at all, in the "shadow" of the string.) The 
coordinates (^'„,0„) on the observer's sky of an image with winding number n and the affine 
parameter s„ at which the light source is met can be determined from Equations ()144l [T45l I146|) . 
We just have to insert p{s) = ps, ^{s) — + 2m:, z{s) = zs and to solve for tan^f — tan5'„, 
tan6 = tan0„, s = s„: 

. ,T, PS sin(fc(ys + 2n7r)) 

tan*„ = — — — , (152) 

PS cos (fc(i^s + 2mT)) - Po 

tane„ ^ VpI+p'o-^PsPo cos (fc(^s + 2n7r)) ^ ^^^^^ 

ZS 



Sn = \Jzl+ pI + pI- 2pspo COS (fc(.^s + 2n7r)). (154) 
The travel time follows from Equation (|147|) : 

Tn = Sn - a{ips + 2mr). (155) 

It is the only relevant quantity that depends on the string's spin a. With the observer on a i-line, 
the aflfine parameter s coincides with the area distance, I?aroa(s) — s, because in the (locally) flat 
string spacetime the focusing equation (j44p reduces to -Darea = 0. For observer and light source on 
t-lines, the redshift vanishes, so s also coincides with the luminosity distance, I?ium(s) = s, owing 
to the general law (j48|). Hence, Equation (|154|) gives us the brightness of images (see Section \2M 
for the relevant formulas). The string metric produces no image distortion because the curvature 
tensor (and thus, the Weyl tensor) vanishes (recall Section . Realistic string models yield a 
mass density p that is smaller than 10^^. So, by Equation (|141l) . only the case A'odd(fc) — 1 and 
Nevcn{k) = 2 is thought to be of astrophysical relevance. In that case we have a single-imaging 
region, < (^s < 27r — n/k, and a double-imaging region, 2tt — ir/k < (ps < (see Figure [24|) . 
The occurrence of double-imaging and of single imaging can also be read from Figure 1251 In 
the double- imaging region we have a ("primary") image with n = and a ("secondary") image 
with n = —1. From Equations (|1531 1154p we read that the two images have different latitudes 
and different brightnesses. However, for k close to 1 the difference is small. If we express k by 
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Equation (|140|) and linearize Equations (|1521ll53lll54M155p with respect to the deficit angle (|140|) . 
we find 



*o = 7 — - — r^r- (156) 

Ps + Po (Ps + Po)27r 

(157) 
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Hence, in this approximation the two images have the same 0— coordinate; their angular distance 
A on the sky is given by Vilenkin's |414j formula 

^^Ps^sinOo^ (161) 
Ps + Po 

and is thus independent of (ps; they have equal brightness and their time delay is given by the 
string's spin a via Equation (|160p . All these results apply to the case that the worldlines of the 
observer and of the light source are t-lines. Otherwise redshift factors must be added. 



Lensing by a transparent string. 

In comparison to a non-transparent string, a transparent string produces additional images. These 
additional images correspond to light rays that pass through the string. We consider the case a = 
and 1 < 1/ k < 2, which is illustrated by Figures [25] and [26l The general features do not depend 
on the form of the interior metric, as long as it monotonously interpolates between a regular axis 
and the boundary of the string. In the non-transparent case, there is a single-imaging region and 
a double-imaging region. In the transparent case, the double-imaging region becomes a triple- 
imaging region. The additional image corresponds to a light ray that passes through the interior 
of the string and then smoothly slips over one of the cusp ridges. The point where this light ray 
meets the worldline of the light source is on the sheet of the light cone between the two cusp ridges 
in Figure [26| i.e., on the sheet that does not exist in the non-transparent case of Figure [25] From 
the picture it is obvious that the additional image shows the light source at a younger age than the 
other two images (so it is a "tertiary image"). A light source whose worldline meets the caustic 
of the observer's past light cone is on the borderline between single-imaging and triple-imaging. 
In this case the tertiary image coincides with the secondary image and it is particularly bright 
(even infinitely bright according to the ray-optical treatment; recall Section [2?6|) . Under a small 
perturbation of the worldline the bright image either splits into two or vanishes, so one is left either 
with three images or with one image. 



5.11 Plane gravitational waves 

A plane gravitational wave is a spacetime with metric 

g = -2dudv- {f{u){x^ - y^) + 2g{u)xy) du^ -f dx^ + dy^ , (162) 

where f{uf' + .g(u)^ is not identically zero. For any choice of f{u) and g{u), the metric (|162|) 
has vanishing Ricci tensor, i.e., Einstein's vacuum field equation is satisfied. The lightlike vector 
field dy is covariantly constant. Non-flat spacetimes with a covariantly constant lightlike vector 
field are called plane-fronted waves with parallel rays or pp-waves for short. They made their first 
appearance in a purely mathematical study by Brinkmann |61] . 
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In spite of their high ideahzation, plane gravitational waves are interesting mathematical models 
for studying the lensing effect of gravitational waves. In particular, the focusing effect of plane 
gravitational waves on light rays can be studied quite explicitly, without any weak-field or small- 
angle approximations. This focusing effect is reflected by an interesting light cone structure. 

The basic features with relevance to lensing can be summarized in the following way. If the 
profile functions / and g are differentiable, and the coordinates {x,y,u,v) range over R^, the 
spacetime with the metric (|162|) is geodesically complete |118l . With the exception of the integral 
curves of dy, all inextendible lightlike geodesies contain a pair of conjugate points. Let q be the first 
conjugate point along a past-oriented lightlike geodesic from an observation event po- Then the 
first caustic of the past light cone of po is a parabola through q. (It depends on the profile functions 
/ and g whether or not there are more caustics, i.e., second, third, etc. conjugate points.) This 
parabola is completely contained in a hyperplane u = constant. All light rays through po, with 
the exception of the integral curve of dy, pass through this parabola. In other words, the entire 
sky oipo, with the exception of one point, is focused into a curve (see Figure [30)) . This astigmatic 
focusing effect of plane gravitational waves was discovered by Penrose [327] who worked out the 
details for "sufficiently weak sandwich waves" . (The name "sandwich wave" refers to the case that 
f{u) and g{u) are different from zero only in a finite interval ui < u < U2.) Full proofs of the 
above statements, for arbitrary profile functions / and g, were given by Ehrlich and Emch |120[|121] 
(cf. [22], Chapter 13). The latter authors also demonstrate that plane gravitational wave spacetimes 
are causally continuous but not causally simple. This strengthens Penrose's observation [327] that 
they are not globally hyperbolic. (For the hierarchy of causality notions see [32].) The generators 
of the light cone leave the boundary of the chronological past I~{po) when they reach the caustic. 
Thus, the above-mentioned parabola is also the cut locus of the past light cone. By the general 
results of Section 12.81 the occurrence of a cut locus implies that there is multiple imaging in the 
plane-wave spacetime. The number of images depends on the profile functions. We may choose 
the profile functions such that there is no second caustic. (The "sufficiently weak sandwich waves" 
considered by Penrose [327] are of this kind.) Then Figure [30] demonstrates that an appropriately 
placed worldline (close to the caustic) intersects the past light cone exactly twice, so there is 
double-imaging. Thus, the plane waves demonstrate that the number of images need not be odd, 
even in the case of a geodesically complete spacetime with trivial topology. 

The geodesic and causal structure of plane gravitational waves and, more generally, of pp- waves 
is also studied in |205l[72] . 

One often considers profile functions / and g with Dirac-delta-like singularities ("impulsive 
gravitational waves"). Then a mathematically rigorous treatment of the geodesic equation, and of 
the geodesic deviation equation, is delicate because it involves operations on distributions which 
are not obviously well-defined. For a detailed mathematical study of this situation see [3871 1246] . 

Garfinkle [163] discovered an interesting example for a pp-wave which is singular on a 2- 
dimensional worldsheet. This exact solution of Einstein's vacuum field equation can be interpreted 
as a wave that travels along a cosmic string. Lensing in this spacetime was numerically discussed 
by VoUick and Unruh [i^ . 

The vast majority of work on lensing by gravitational waves is done in the weak-field approxi- 
mation. Both for the exact treatment and for the weak-field approximation one may use Kovner's 
version of Fermat's principle (see Section [23]) . which has the advantage that it allows for time- 
dependent situations. Applications of this principle to gravitational waves have been worked out 
in the original article by Kovner |240) and by Faraoni |139[ 1140] . 
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V — u 



Figure 30: Past light cone of an event po in the spacetime (I162p of a plane gravitational wave. 
The picture was produced with profile functions /(u) > and g{u) — 0. Then there is focusing 
in the x-direction and dcfocusing in the y-direction. In the (2 + l)-diinensional picture, with the 
j/-coordinate not shown, the past light cone is completely refocused into a single point q, with the 
exception of one generator A. It depends on the profile functions whether there is a second, third, 
and so on, caustic. In any case, the generators leave the boundary of the chronological past I^{po) 
when they pass through the first caustic. Taking the y-coordinate into account, the first caustic 
is not a point but a parabola ( "astigmatic focusing" ) (see Figure I3ip . An electromagnetic plane 
wave (vanishing Weyl tensor rather than vanishing Ricci tensor) can refocus a light cone, with the 
exception of one generator, even into a point in 3 + 1 dimensions ( "anastigmatic focusing") (cf. 
Penrose |327| where a hand-drawing similar to the picture above can be found). 
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Figure 31: "Small wave fronts" of the light cone in the spacetinie ()162p of a plane gravitational 
wave. The picture shows the intersection of the light cone of Figure[3D]with the lightlike hyperplane 
u = constant for three different values of the constant: (a) exactly at the caustic (parabola), (b) 
at a larger value of u (hyperbolic paraboloid), and (c) at a smaller value of u (elliptic paraboloid). 
In each case, the hyperplane u = constant does not intersect the one generator A tangent to dy] 
all other generators are intersected transversely and exactly once. 
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